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INDUCTION OF CTL RESPONSES TO FOREIGN 
ANTIGENS EXPRESSED IN MYCOBACTERIA 

This invention relates to the induction of a T-cell 
response, in particular a cytotoxic T lymphocyte response. More 
particularly, this invention relates to the induction of CTL 
responses to proteins or polypeptides expressed by recombinant 
mycobacteria. 

Cell-mediated immunity (or CMI) of infections is thought to 
be a major line of defense against certain infections, such as 
viral infections and certain bacterial infections. For example, 
CMI may be significant in the development of an effective vaccine 
against human immunodeficiency virus (HIV), or AIDS virus, 
because HIV vaccines and/or therapies based on the generation of 
passive transfer of HIV-specific antibody in the absence of 
cell-mediated immunity have not yielded consistent protection in 
primates challenged with the HIV virus. Thus, interest has 
turned to the induction of cell-mediated responses to various 
infections, such as for example, HIV infection, and to the 
identification of proteins or polypeptides that a stimulate a 
cytotoxic T lymphocyte response, and to methods of administering 
such proteins or polypeptides. 

In accordance with an aspect of the present invention, there 
is provided a method of inducing a CTL response in an animal 
comprising administering to the animal mycobacteria transformed 
with at least one DNA sequence which encodes a protein or peptide 
or fragment or derivative thereof which includes an epitope which 
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is recognized by cytotoxic T lymphocytes. The mycobacteria are 
administered in an amount effective to induce a CTL response in 
an animal. 

Proteins or polypeptides for which the at least one DNA 
sequence may encode, include, but are not limited to, 
Mycobacterium leprae antigens; Mycobacterium tuberculosis 
antigens; Rickettsia antigens; Chlamydia antigens; Coxiella 
antigens; malaria sporozoite and merozoite proteins, such as the 
circumsporozoite protein from Plasmodium berghei sporozoites; 
Clostridium antigens; Leishmania antigens; Salmonella antigens; 
Mycobacterium af ricanum antigens; Mycobacterium intracellulare 
antigens; Mycobacterium avium antigens; E. coli antigens; Borrelia 
antigens; Listeria antigens; Franciscella antigens; Yersinia 
antigens; Trwonema antigens; Schistostoma antigens; Filaria 
antigens; Pneumococcus antigens; Staphylococcus antigens; Herpe* 
virus antigens; influenza and parainfluenza virus antigens; 
measles virus antigens; mumps virus antigens; hepatitis virus 
antigens; Shigella antigens; Bordatella antigens; Hemophilus 
antigens; Streptococcus antigens; polio virus antigens; Rift 
Valley Fever virus antigens; dengue virus antigens; Human 
Immunodeficiency Virus (HIV) antigens; and respiratory syncytial 
virus (RSV) antigens. 

In one embodiment, the at least one DNA sequence encodes at 
least one protein or polypeptide or fragment or derivative 
thereof which includes an eptiope which is recognized by 
cytotoxic T lynqjhocytes induced by an HIV protein or fragment or 
derivative 'thereof. The at least one DNA sequence may encode an 
HIV protein or fragment or derivative thereof. HIV proteins or 
polypeptides which mmy be encoded by the at least one DNA 
sequence includes but are not limited to, HIV-I-gp 120; HlV-I-gp 
41; HlV-I-gp 160; HIV-I-pol; HIV-I-nef; HlV-I-tat; HlV-I-rev; 
HIV-I-vif; HIV-I-vpr; HIV-I-vpu; HIV-I-gag; HIV-2-gp 120; 
HIV-2-gp 160; HIV.2-gp 41; HIV-2-gag; HIV.2-pol; HIV.2-nef; 
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HIV-2«tat; HIV-2-rev; HIV.2.vif; HIV-2-vpr; Hrv.2-vpu; and 
Hrv-2-vpx. 

Mycobacteria which may be transformed with the at least one 
DNA sequence, which encodes a protein or polypeptide or fragment 
or derivative thereof which includes an epitope which is 
recognized by cytotoxic T lymphocytes, include, but are not 
limited to, Mycobacte rium bovis ^BCG. M. smeqmati s . M. avium . 
M-Phlei, M.fortiutum. M. lufu , M. paratuberculosia . M.habana , 
M. scrofalaceum, and M. intracellularc . In a preferred embodiment, 
the mycobacterium is W.bovis -BCG or a mutant thereof. 

The at least one DNA sequence may be contained within an 
expression vector, which is transformed into a mycobacterium, 
whereby the mycobacterium expresses the protein or polypeptide or 
fragment or derivative thereof which includes an epitope which is 
recocrnized by cytotoxic T lymphocytes. 

The expression vector may be, for example, a temperate 
shuttle phasmid or a bacterial-mycobacterial shuttle plasmid. 
Each of these vectors may be used to introduce the at least one 
DNA sequence encoding a protein or polypeptide or fragment or 
derivative which includes an epitope which is recognized by 
cytotoxic T lymphocytes, stably into mycobacteria, in which the 
at least one DNA sequence may be expressed. When a shuttle 
phasmid, which replicates as a plasmid in bacteria and a phage in 
mycobactreia, Is employed, integration of the phasmid, which 
includes the at least one DNA sequence encoding a protein or 
polypeptide, or fragment or derivative thereof, which includes an 
epitope which is recognized by cytotoxic T lymphocytes, into the 
mycobacterial chromosome occurs through site*- specific 
integration. The at least one DNA sequence which encodes a 
protein or polypeptide or fragment or derivative thereof, which 
includes an epitope which is recognized by cytotoxic 
T lymphocytes, is replicated as part of the chromosomal DNA. When 
a bacterial-mycobacterial shuttle plasmid is employed, the at 
least one DNA sequence which encodes a protein or polypeptide or 
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fragment or derivative thereof, which includes an epitope which 
is recognized by cytotoxic T lymphocytes, is stably maintained 
extrachromosomally in a plasmid. Expression of the at least one 
DNA sequence occurs extrachromosomally (e.g., episomally) . For 
example, the at least one DNA sequence is cloned into a shuttle 
plasmid and the plasmid is introduced into a mycobacterium such 
as those hereinabove described, wherein the plasmid replicates 
episomally. Examples of such shuttle phasmids and 
bacterial-mycobacterial shuttle plasmids are further described in 
Application Serial No. 361,944, filed June 5, 1989, which is 
hereby incorporated by reference. 

In one embodiment the mycobacteria are transformed with an 
expression vector which comprises at least one ONA sequence 
encoding a protein or polypeptide which includes an epitope which 
is recoqrnized by cytotoxic T lymphocytes, and a promoter selected 
from the class consisting of mycobacterial promoters and 
mycobacteriophage promoters for controlling expression of the DNA 
encoding the heterologous protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes. 

Mycobacterial and mycobacteriophage promoters which may be 
employed include, but are not limited to, mycobacterial promoters 
such as the BCG HSP60 and HSP70 promoters; the mycobactin 
promoter from M. txiberculosis and BCG; the mycobacterial 14 kda 
and 12 kda antigen promoters; the mycobacterial o-antigen 
promoter from M, txiberculosis or BCG; the MBP-70 promoter, the 
mycobacterial 45 kda antigen promoter from M. t\iberculosis or BCG; 
the superoxide diamutase promoter; the mycobacterial asd 
promoter, and Bycobacteriophage promoters such as the Bxbl, LI, 
L5, and TM4 promoters. In one embodiment, the promoter is a 
mycobacterial heat shock protein promoter such as HSP60 or HSP70. 

The promoter sequence may, in one embodiment, be part of an 
expression cassette which also includes a portion of the gene 
normally under the control of the promoter. For example, when a 
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mycobacteri.l HSP60 or HSP70 promoter is employed, the expressio 
cassette may, within the scope of the present invention, include 
in addition to the promoter, a portion of the gene for the HSP60 
or HSP70 protein. When the expression cassette and the DNA 
encoding the protein or polypeptide, or fragment or derivative 
thereof, which includes an epitope which is recognized by 
cytotoxic T lymphocytes are expressed, the protein expressed by 
the cassette and the DNA encoding the protein or polypeptide is a 
fusion protein of a fragment of a mycobacterial protein (eg., thi 
HSP60 or HSP70 protein), and the protein or polypeptide or 
fragment or deriviative thereof which includes an epitope which 
is recognized by cytotoxic T lymphocytes. 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon, which 
provides for the initiation of the translation of mRNA. are eac|i 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the protein or 
polypeptide or fragment or derivative thereof which includes an 
epitope which is recognized by cytotoxic T lymphocytes, and the 
transcription termination site, may be of mycobacterial origin, 
or of other bacterial origin, or such stop codon and 
transcription termination site may be those of the DNA encoding 
the protein or polypeptide which Includes an epitope which is 
recognized by cytotoxic T lymphocytes. 

Preferably, the mycobacterial promoter is a BCG promoter, 
and the mycobacteriun is BCG. 

In one embodiment, the expression vector may further include 
DNA which encodes for proteins or polypeptides such as, but not 
limited to, antigens, anti-tumor agents, enzymes, lymphokines, 
pharmacologic agents, immunopotentiators, reporter molecules of 
interest in a diagnostic context, and selectable markers. 

Selectable markers which may be encoded include, but are not 
limited to, the 0-galactosidase marker, the kanamycin resistance 
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marker, the chioroamphenicol resistance marker, the neomycin 
resistance marker, and the hygromycin resistance marker. 

In accordance with one embodiment, the vector further 
includes a mycobacterial origin of replication. 

In accordance with another embodiment, the vector may be a 
plasmid. The plasmid may be a non-shuttle plasmid, or may be a 
shuttle plasmid which further includes a bacterial origin of 
replication such as an E.coli origin of replication, a Bacillus 
origin of replication, a Staphylococcus origin of replication, a 
Streptomvces origin of replication, or a pneumococcal origin of 
replication. In one embodiment, the shuttle plasmid includes an 
E, coli origin of replication. 

In accordance with yet another embodiment, the vector may 
further include a multiple cloning site, and the DNA encoding for 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic 
T lymphocytes is inserted in the multiple cloning site. 

In addition to the DNA encoding a heterlogous protein or 
polypeptide, and the mycobacterial promoter for controlling 
expression of the DNA encoding the protein or polypeptide which 
includes an epitope which is recognized by cytotoxic 
T lymphocytes, the expression vector may, in one embodiment, 
further include a DNA sequence encoding bacteriophage integration 
into a mycobacterixira chromosome. Bacteriophages from which the 
DNA sequence encoding bacteriophage integration into a 
mycobacterium chromosome may be derived include, but are not 
limited to, mycobacteriophages such as but not limited to, the 
L5, LI, Bxbl, and TM4 mycobacteriophages; the lambda phage of 
coli ; the toxin phages of Corvnebacteria ; phages of Actinomycetes 
and Norcardia; the #031 phage of Streptomvces; and the P22 phage 
of Salmonella . Preferably, the DNA sequence encodes 
mycobacteriophage integration into a mycobacterium chromosome. 
The DNA sequence which encodes bacteriophage integration into a 
mycobacterium chromosome may include DNA which encodes integrase, 
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which is a protein that provides for integration of the vector 
into the mycobacterial chromosome. Preferably, the DNA sequence 
encoding mycobacterial phage integration also includes DNA which 
encodes an attP site. 

The DNA encoding the attP site and the integrase provides 
for an integration event which is referred to as site-specific 
integration. DNA containing the attP site and the integrase gene 
is capable of integrating into a corresponding attB site of a 
mycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 

Examples of expression vectors which include mycobacterial 
promoters and mycobacteriophage promoters, for controlling the kt 
least one DNA sequence encoding a protein or polypeptide, or 
fragment or derivative thereof, which includes an epitope which 
is recognized by cytotoxic T lymphocytes are further described in 
application Serial No. 642,017, filed January 16, 1991, which is 
a continuation of Application Serial No. 552,828, filed July 16, 
1990, now abandoned. The contents of Application Serial No. 
642,017 are hereby incorporated by reference. 

In another embodiment, the mycobacter'ia are transformed with 
DNA which comprises a first DNA sequence which is a phage DNA 
portion encoding bacteriophage integration into a mycobacterium 
chromosome, and DNA Including the at least one DNA sequence 
encoding a protein or polypeptide, or fragment or derivative 
thereof, which Includes an epitope which is recognized by 
cytotoxic T lymphocytes. 

The term "phage DNA portion", as used herein, means that the 
DNA sequence Is derived from a phage and lacks the DNA which is 
required for phage replication. 

Bacteriophages from which the phage DNA portion may be 
derived include, but are not limited to, mycobacteriophages, such 
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as but not limited to the those hereinabove described. 
Preferably, the phage DNA portion encodes mycobacteriophage 
integration into a mycobacterium chromosome. 

In a preferred embodiment, the first DNA sequence includes 
DNA encoding integrase, which is a protein that provides for 
integration of the DNA into the mycobacterial chromosome. Most 
preferably, the first DNA sequence also includes DNA which 
encodes an AttP site. 

The DNA sequence encoding the AttP site and the integrase 
provides for an integration event which is referred to as 
site-specific integration. DNA containing the AttP site and the 
integrase gene is capable of integration into a corresponding 
AttB site of a mycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 

The integration event results in the foxnnation of two new 
junction sites called AttL and AttR, each of which contain part 
of each .of AttP and AttB. The inserted and integrated non-phage 
DNA which includes the first DNA sequence and the at least one 
DNA sequence, which encodes a protein or polypeptide, or fragment 
or derivative thereof, which includes an epitop>e which is 
recognized by cytotoxic T lymphocytes is flanked by the AttL and 
AttR sites. The insertion and integration of the phage DNA 
portion results in the formation of a transformed mycobacterium. 

The MIA may further include DNA which encodes a selectable 
marker or Barkers; or other proteins or polypeptides of interest, 
such as, but not limited to anti-tumor agents, enzymes, 
lymphokines, pharmacologic agents, immunopotentiators, and 
reporter molecules of interest in a diagnostic context. 

Selectable markers which may be encoded include, but are not 
limited to, the kanamycin resistance marker, the neomycin 
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resistance marker, the chloroamphenicol resistance marker, and 
the hygromycin resistance marker. 

The phage DNA portion of the present invention, which 
includes the first DNA sequence encoding mycobacterium phage 
integration into a mycobacterium chromosome, and the at least one 
DNA sequence encoding a protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope recognized by 
cytotoxic T lymphocytes, may be constructed through genetic 
engineering techniques known to those skilled in the art. In a 
preferred embodiment, the phage DNA portion may be a plasmid 
including, in addition to the DNA encoding integration and the 
DNA encoding a protein or polypeptide, or fragment or derivative 
thereof, which includes an epitope recognized by cytotoxic 
T lymphocytes, an origin of replication for any of a wide variety 
of organisms, which includes, but is not limited to, E.coli . 
Streptomvces species. Bacillus species, Staphylococcus species,. 
Shigella species. Salmonella species and various species of 
pneumococci. Most preferably, the plasmid includes an origin of 
replication for E. coll . 

The phage DNA portion also may include a suitable promoter. 
Suitable promoters include, but are not limited to, mycobacterial 
promoters and mycobacteriophage promoters such as those 
hereinabove described. 

The promoter sequence may, in one embodiment, be part of an 
expression cassette which also includes a portion of the gene 
normally under the control of the promoter, as hereinabove 
described. For example, when a mycobacterial HSP60 or HSP70 
promoter is employed, the expression cassette may include, in 
addition to the promoter, a portion of the gene for the RSP60 or 
HSP70 protein. When the expression cassette and the DNA encoding 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic T 
lymphocytes are expressed, the protein expressed by the cassette 
and the DNA encoding the protein or polypeptide is a fusion 
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protein of a fragment of a mycobacterial protein (eg., the HSP60 
or HSP70 protein), and the protein or polypeptide, or fragment or 
derivaitve thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes. 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon, which 
provides for the initiation of the translation of mRNA, are each 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the protein or 
polypeptide, or fragment or deriviatve thereof, which includes an 
epitope which is recognized by cytotoxic T lymphocytes, and the 
transcription termination site, may be of mycobacterial origin, 
or of other bacterial origin, or such stop codon and 
transcription termination site may be those of the DNA encoding 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic 
T lymphocytes. 

Examples of DNA which includes a first DNA sequence which is 
a phage DNA portion encoding bacteriophage integration into a 
mycobacterium chromosome, and DNA including the at least one DNA 
sequence encoding a protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes are further described in application 
Serial No. 553,907, filed July 16, 1990, the contents of which 
are hereby incorporated by reference. 

Mycobacteria which are transformed which DNA which encodes 
for a prot«in or polypeptide or fragment(8) or derivative(8) 
thereof, which includes an epitope which is recognized by 
cytotoxic T lymphocyte., may be employed in a composition, such 
as a vaccina, for inducing a CTL response in an aniaml. The 
vaccine may be administered to a human or non-human animal. 

To form such a vaccine, the transformed mycobacteria are 
administered in conjunction with a suitable pharmaceutical 
carrier. As representative examples of suitable carriers there 
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may be mentioned: mineral oil, alum, synthetic polymers, etc. 
Vehicles for vaccines are well known in the art and the selection 
of a suitable vehicle is deemed to be within the scope of those 
skilled in the art from the teachings contained herein. The 
selection of a suitable vehicle is also dependent upon the manner 
in which the vaccine is to be administered. The vaccine may be 
in the form of an injectable dose and may be administered 
intramuscularly, intravenously, orally, intradermally, or by 
subcutaneous administration. 

Other means for administering the vaccine or therapeutic 
agent should be apparent to those skilled in the art from the 
teachings herein; accordingly, the scope of the invention is not 
to be limited to a particular delivery form. 

When the transformed mycobacteria are employed as a vaccine, 
such a vaccine has important advantages over other presently 
available vaccines. Mycobacteria have, as hereinabove indicated, 
adjuvant properties among the best currently kno%m and, 
therefore, stimulate a recipient's immune system to respond with 
great effectiveness. This aspect of the vaccine induces 
cell-mediated immunity and thus is especially useful in providing 
immunity against pathogens in cases where cell-mediated immunity 
appears to be critical for resistance. Also, mycobacteria may 
stimulate long-term memory or immunity. It thus may be possible 
to prime long-lasting T cell memory, which stimulates secondary 
antibody responses neutralizing to the infectious agent. Such 
priming of T cell memory is useful, for example, against 
pertussis, malaria, influenza virus. Herpes virus, rabies, Rift 
Valley fever virus, dengue virus, measles virus, Human 
Immunodeficiency Virus (HIV), and respiratory syncytial virus. 

The invention vlll now be described with respect to the 
following examples; however, the scope of the present invention 
is not to be limited thereby. 

Example 1 
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A. Construction of plasmid including mycobacterial promoter 
expression cassette and lacZ gene. 
1. Construction of PYUB125 

Plasmid pALSOOO, a plasmid which contains an origin of 
replication of M. fortuitum . and described in Labidi, et al., 
FEMS Microbiol. Lett .. Vol. 30, pgs. 221-225 (1985) and in Gene , 
Vol. 71, pgs. 315-321 (1988), is subjected to a partial Sau 3A 
digest, and 5kb fragments are gel purified. A 5kb fragment is 
then ligated to Bam HI digested pIJ666 (an. E. coli vector 
containing an E. coli origin of replication and also carries 
neomycin-kanamycin resistance, as described in Kieser, et al.. 
Gene , Vol. 65, pgs. 83-91 (1988) to form plasmid pYUB12. A 
schematic of the formation of plasmid pYUB12 . A schematic of the 
formation of plasmid pyUB12 is shown in Figure 1. pYUB12 and 
pIJ666 were then transformed into M. smeqmatia and BCG. 
Neomycin-resistant transformants that were only obtained by 
pYUB12 transformation confirmed that pAL5000 conferred autonomous 
replication to pIJ666 in M, smeqmatia and BCG. 

Shotgun mutagenesis by Snapper, et al (1988, hereinabove 
cited) indicated that no more than half of the pALSOOO plasmid 
was necessary to support plasmid replication in BCG. This 
segment presumably carried open reading frames ORFl and GRF2, 
identified by Rauzier, et al.. Gene, Vol. 71, pgs. 315-321 
(1988), and also presumably carried a mycobacterial origin of 
replication. pYUB12 is then digested with Hpal and EcoRV, a 2586 
bp carrying thla region or segment pAL5000 is removed and ligated 
to PvuII digested pYUBS. Plasmid pYUBS (a pBR322 derivative) 
includes an E. coll replicon and a kan*^ (aph) gene. Ligation of 
the 2586 bp pYUB12 fragment to PvuII digested pYUBS results in 
the formation of pYUB53, as depicted in Figure 2. Transformation 
of PYUB53 confirmed that the EcoRV-Hpal fragment, designated 
M.rep, was capable of supporting autonomous replication in BCG. 

Plasmid pYUB53 was then digested with AatI, EcoRV, and PstI 
in order to remove the following restriction sites: 
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AatI 5707 

EcoRI 5783 

BamHI 5791 

Sail 5797 

PstI 5803 

PstI 7252 

Sail 7258 

BamHI 7264 

EcoRI 7273 

Clal 7298 

Hindi II 7304; and 

EcoRV 7460 

Fragment ends are then flushed with T4 DMA polymerase and 
religated to form plasmid pYUB125, construction of which is shown 
in Figure 3. 

2. Elimi nation of superfluous vector DNA from DYUB125 
792 bases of the tet gene, which had been inactivated by 

prior manipulations, was eliminated by a complete Narl digest, 
gel purification of the 6407 bp fragment, and 

ligation/recirculation, transformation of E. coll strain HBlOl, 

p 

and selection of Kan transf ormants , The construction of 
resulting plasmid, pMVlOl, is schematically indicated in Figure 
4, and the DNA sequence of pMVlOl, which includes markings of 
regions which will be deleted, and of mutations, as hereinafter 
described, is 8ho%m in Figure 5. 

3. Constructlcn of expression cassette based on BCG HSP60. 
Among the moat abundant proteins in mycobacteria is the 

HSP60 heat shock protein (also known as the 65 kda antigen). 
Because abundance of the HSP60 protein in mycobacteria indicates 
strong HSP60 gene expression, the sequence controlling HSP60 
expression was chosen to control expression of heterologous genes 
encoding antigens or other proteins in BCG. 

The published sequence of the BCG HSP60 gene (Thole, et al. 
Infect, and Immun. . Vol. 55, pgs. 1466-1475 (June 1987)), and 
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surrounding sequence permitted the construction of a cassette 
carrying expression control sequences (i.e., promoter and 
translation initiation sequences) by PGR. The BCG HSP61 cassette 
(Figure 6) contains 375 bases 5' to the BCG HSP60 start codon, 
and 15 bases (5 codons) 3' to the start codon. PGR 
oligonucleotide primers were then synthesized. Primer Xba-HSP60, 
of the following sequence: 

GAG ATC TAG ACG GTG ACC ACA ACC CGC C / 
was synthesized for the 5' end of the cassette, and primer 
Bam-HSP61, of the following sequence: 

CTA GGG ATC CGC AAT TGT CTT GGC CAT TG Jl 
was synthesized for the 3' end of the cassette. The primers were 
used to amplify the cassette by PGR from BCG strain Pasteur 
chromosomal DNA. The addition of the Bam HI site at the 3' end 
of the cassette adds one codon (Asp) to the first six codons of 
the HSP60 gene. 

Each of pMVlOl and the PCR cassette HSP61 was digested with 
Nhel and BamHI . The PCR cassette was then inserted between the 
Nhel and BamHI sites of pMVlOl, then ligated to form plasmid 
pMV65A (Figure 7). 

The E. coli lac Z gene (Figure 8) was used as a reporter, or 
marker gene to assay the ability of the HSP61 cassette to express 
heterologous genes in BCG. A BamHI restriction fragment carrying 
the lac Z gene was cloned into the Bam HI site of Bam HI digested 
pMV65A, resulting in the formation of phfV65A/LZ as indicated 
schematically in Figure 7. The formation of pMV65A/LZ results in 
a fusion between the HSP60 and lac Z genes at the sixth codon of 
the HSP60 gene and the sixth codon of the lec Z gene. pMV65A/LZ 
was then trensformed into E, coli . Blue E. coli colonies were 
selected on x-gel plates for the presence of pMV65A/LZ, thus 
indicating that the HSP60 promoter and translation initiation 
sequences were also active in E. coli . 

PMV65A/LZ was then transformed into BCG and plated on Dubos 
Oleic Agar plates containing x-gal. All BCG colonies resulting 
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from this transformation exhibited blue color, thus indicating 
that the lac_2 gene product (B-galactosidase ) was expressed in 
BCC. SDS polyacrylamide gel electrophoresis was performed on 
lysates of the pMV65A/LZ BCG recombinants, revealing that 
B-galactosidase protein was expressed to levels in excess of 10% 
of total BCC protein (as determined by staining with Coomassie 
brilliant blue). These data indicated that BCC HSP61 expression 
cassette was functional in expression vector pMV65A. 

Example 2 

Cytotoxic T lymphocyte respon s e to E. rnli B-oalaetosidaae . 

^- 6-galacto8idase was expressed in BCC as a six amino 

acid fusion protein with BCC hep 60 protein using 
extrachroraosomal plasmid vector pMV65A/L2 utilizing the HSP60 
promoter to drive expression. The recombinant BCG was grown ta 
mid- log phase in Dubos media and concentrated by centrifugation. 
The bacteria were then re-suspended in PBS plus 0.05% Tween 80 
and cup sonicated briefly to disperse clumped bacteria. Six week 
old BALB/c mice were inoculated with a single dose of 2 X 10*, 
2 X 10 , or 2 X 10 colony forming units (CFU'S determined 
post-inoculation) by either intradermal (ID), intraperitoneal 
(IP), or intravenous (IV) injection. At 14 or 19 weeks 
post-immunization splenocytes were harvested from mice and CTL 
activity was measured. CTL activity was measured as follows: 

Splenocytes ( ACK-treated, 5 X loVml) were stimulated in 
vitro in 10 ml in upright T25 flasks by co-culture for 5 days 
with mitomycin C- treated cells trans fee ted with the lac Z gene 
(C3-4 cells; 5 X loVml). A 4 hr. ^^Cr release assay was then 
performed in triplicate using P815 and P13.1 cells (P815 ceils 
transf acted with the lac Z gene) as targets. Various 
«ffector-target ratios were tested using 5,000 targets/well. 
Specific lysis was calculated as follows: % specific lysis = 
100 X [release by effector cells minus spontaneous release/ 
maximal release minus spontaneous release]. 
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At 19 weeks post-immunization, the remaining animals in each 
group were boosted by intraperitoneal injection at 10 ug of 
purified lac 2 emulsified in incomplete Freund's adjuvant (IFA). 
Splenocytes were then harvested from these animals at 23 weeks 
and CTL activity was again measured. Unimmunized animals or 
animals immunized with lac Z emulsified in IFA, or animals 
immunized with vaccinia virus expressing lac Z served as 
controls - 

The results of the avove experiments, as determined by % 
specific lysis of target cells, indicated that a CTL response was 
induced in mice immunized with BCG transformed with the 
expression vector pMV65A/LZ. 

Example 3 

Construction of integrating plasmid including mycobacterial 
promoter expression cassette and HIV*I*gp 120 gene . 
1 . Elimination of undesirable restriction sites in aph fkan ) 
^ene. 

To facilitate future manipulations, the Hindi I I and Clal 
restriction sites in the aph gene in plasmid pMVlOl were 
mutagenized simultaneously by polymerase chain reaction (PGR) 
mutagenesis according to the procedure described in Gene, Vol. 77 
pgs. 57-59 (1989). The bases changed in the aph gene were at the 
third position of codons (wobble bases) within each restriction 
site and the base s\ib«titutions made were designed not to change 
the amino acid sequence of the encoded protein. 

Separate PGR reactions of plasmid pMVlOl with primer 
ClaMut-Kan'* HlndRMut-Kan and HindFMut-Kan * Bam-Kan were 
performed at 90*0 (1 mln.), 50**C (1 min.), and 72*'C (1 min. ) for 
25 cycle*. The PCR primers had the following base seqeuences: 

ClaMut-Kan 

CTT GTA TGG GAA GCC CC 3 
HlndRMut-Kan 

GTC ACA ATG GCA AAA GAT TAT GCA TTT CTT TCC AG f 
HindFMut-Kan 
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GTG TGG AAA GAA ATG CAT AAT CTT TTG CCA TTC TCA CCG G r 
Bam*Kan 

CGT AGA GGA TCC AGA GGA CG ^ 
The resulting PGR products were gel purified and mixed and a 
single PGR reaction without primers was performed at 94*'C (1 
min.), 72 °C (1 min. ) for 10 cycles. Primers ClaMut-Kan and 
Bam-Kan were added and PGR was resumed at 94**C (1 min.)/ SO^'C ( 
min.), and 72**C (2 min. ) for 20 cylces. The resulting PGR 
product (Kan. mut) was digested with BamHI and gel purified. 
Plasmid pMVlOl was digested with Clal and cohesive ends were 
filled in by Klenow + dCTP ^ dGTP. Klenow was heat inactivated 
and the digest was further digested with BamHI. The 5232 base 
pair fragment was gel purified and mixed with fragment Kan. mut 
and ligated. The ligation was transformed into E. coli strain 
HBlOl and Kan^ colonies were screened for plasmids resistant to 
Clal and Hindlll digestion. Such plasmids were designated as 
pMVllO, which is depicted in Figure 4. 

2. Eli mination of sequences not necessary for olasmid 

replication in mycobacteria. 

Plasmid pMVllO was resected in separate constructions to 
yield plasmids pMVlll and pMV112. In one construction, pMVllO 
was digested with Narl and Ball, the ends were filled in, and a 
5296 base pair fragment was ligated and recircularized to form 
pMVlll. In another constxnict, pMVllO was digested with Ndel and 
SplI, the ends were filled in, and a 5763 base pair fragment was 
ligated and recircularized to form pMV112. Schematics of the 
constructions of pMVlll and pMV112 are shown in Figure 9. These 
constructions further eliminated superfluous E> coli vector 
sequences derived from pALSOOO not necessary for mycobacterial 
replication. Cloning was performed in E. coli . Plasmids pMVlll 
and pMV112 were tested for the ability to replicate in M. 
smeomatis . Because both plasmids replicated in M. smegraatis the 
deletions of each plasmid were combined to construct pMV113. 
(Figure 9). 
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To construct pMV113, pMVlll was digested with BamHI and 
EcoRI, and a 1071 bp fragment was isolated. pMV112 was digested 
with BamHI and EcoRI, and a 3570 bp fragment was isolated, and 
then ligated to the 1071 bp fragment obtained from pMVlll to fo 
pMV113. These constructions thus defined the region of pALSOCO 
necessary for autonomous replication in mycobacteria as no larger 
than 1910 base pairs. 

3.. Mutagenesis of restriction sites in mycobacterial 

reDlicon> 

To facilitate further manipulations of the mycobacterial 
replicon, PGR mutagenesis was performed as above to eliminate the 
Sal I, EcoRI, and Bglll sites located in the open reading frame 
known as ORFl of pALSOOO, PCR mutagenesis was performed at 
wobble bases within each restriction site and the base 
substitutions were designed not to change the amino acid sequence 
of the putative encoded ORFl protein. The restriction sites were 
eliminated one at a time for testing in mycobacteria. It was 
possible to eliminate the Sail and EcoRI without altering 
replication in M. smeomatis - In one construction PCR mutagenesis 
was performed at EcoRI1071 of pMV113 with primers Eco Mut - M.rep 
and Bam-M,rep to form pMV117, which lacks the EcoRI1071 site. 
Primer Eco Mut - M.rep has the following sequence: 

TCC GTG CAA CGA GTG TCC CGG A; 7 

and Bam-M-rep has the following sequence: 

CAC CCG TCC TGT GOA TCC TCT AC. ^ 
In another construction, PCR mutagenesis was performed at 
the Sail 1389 site with primer Sal Mut • M.rep and Bam-M.rep to 
form PMV119, which lacks the Sail 1389 site. Primer Sal Mut- 
M.rep has the following sequence: 

TGG CGA CCG CAG TTA CTC AGG CCT. 7 
pMV117 was then digested with ApaLI and Bglll, and a 3360 bp 
fragment was isolated. pMV119 was digested with ApaLI and Bglll, 
and a 1281 bp fragment was isolated and ligated to the 3360 bp 
fragment isolated from pMV117 to form pMV123 . A schematic of the 
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constructions of plasmids pMV117, pMV119, and pMV123 is shown in 
Figure 10. Elimination of the Bglll site, however, either by PCR 
mutagenesis or Klenow fill in, eliminated plasmid replication in 
mycobacteria, thus suggesting that the Bglll site is in proximity 
to, or within a sequence necessary for mycobacteria plasmid 
replication. 

4. Construction of dMV2Q0 series vectors. 
To facilitate manipulations of all the components necessary 
for plasmid replication in E. coli and mycobacteria, (E. rep. and 
M. rep.) and selection of recombinants (Kan^), cassettes of each 
component were constructed for simplified assembly in future 
vectors and to include a multiple cloning site (MCS) containing 
unique restriction sites and transcription and translation 
terminators. The cassettes were constructed to allow directional 
cloning and assembly into a plasmid where all transcription is 

unidirectional, 
e 

Kan Cassette 

A DNA cassette containing the aph (Kan^) gene was 
constructed by PCR with primers KanS ' and Kan3 ' . An Spel site 
was added to the 5' end of the PCR primer Kan3\ resulting in the 
formation of a PCR primer having the following sequence: 

CTC GAC TAG TGA GGT CTG CCT CGT GAA G, fO 

Bam HI + Nhel sites were added to the 5* end of the primer 
Kan5', resulting in the formation of a PCR primer having the 
following sequence: 

CAG AGO ATC CTT AGC TAG CCA CT GAC CTC GGG G. // 

PCR was performed at bases 3375 and 4585 of pMV123, and 
BamHI and Nh«I sites were added at base 3159, and an Spel site 
was added at base 4585. Digestion with BamHI and Spel, followed 
by purification resulted in a 1228/2443 Kan cassette bounded by 
BamHI and Spel cohesive ends with the direction of transcription 
for the aph gene proceeding from BamHI to Spe I. 

E. rep« cassette 
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A DNA cassette containing the ColEI replicon of pUC19 was 
constructed by PGR with primers E.rep/Spe and E.rep/Mlu. An Spel 
site was added to the 5* end of PGR primer E.rep/Spe and an Mlul 
site was added to the 5' end of PGR primer E.rep./Mlu. The 
resulting primers had the following sequences: 

E. rep. /Spe 

GCA CTA GTT GCA CTG AGC GTG AGA GCC /-? 
E. rep. /Mlu 

GAG AAC GCG TTG CGC TCG GTG GTT CGG GTG. iS 
PGR was performed at bases 713 and 1500 of pUC19, and an 
Mlul site was added to base 713, and a Spel site was added to 
base 1500. Digestion with Mlul and Spel, followed by 
purification resulted in an E.rep. cassette bounded by Spel and 
Mlul cohesive ends with the direction of transcription for RNA I 
and RNA II replication primers proceeding from Spel to Mlul. 
M.rep. cassette 

A DNA cassette containing sequences necessary for plasmid 
replication in mycobacteria was constructed by PGR of pMV123 with 
primers M.rep/Mlu and M. rep/Bam. An Mlul site was added to the 
5* end of PGR primer M.rep/Mlu. A BamHI site was added to the 5' 
end of PGR primer M. rep/Bam. The resulting PGR primers had the 
following base sequences: 

M, rep. /Mlu 

GCA TAG GCG TGA GCC CAG GAG CTG CG 
M, rep. /Bam _ 
CAC CCG TCC TCT GGA TCG TCT AC ^ 

PGR was performed at bases 134 and 2082 of pMV123 . An Mlul 
sited was added to base 2082. Digestion with BamHI and Mlul, 
followed by gel purification resulted in a 1935 base pair DNA 
cassette bounded by Mlul and BamHI cohesive ends with the 
direction of transcription for the pALSOOO ORFl and 0RF2 genes 
proceeding from Mlul to Bam HI. 

The Kan^, E.rep, and M.rep PGR cassettes were then mixed in 
equimolar concentrations and ligated, and then transformed in E, 
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coli strain HBlOl for selection of Kan^ transf ormants . Colonies 
were screened for the presence of plasmids carrying all three 
cassettes after digestion with BamHI + Mlul + Spel and designated 
pMV200. An additional restriction site, Ncol, was eliminated 
from the M.rep cassette by digestion of pMV200 with Ncol, fill in 
with Klenow, and ligation and recircularization, resulting in the 
formation of pMV201. A schematic of the formation of pMV200 from 
pMVI23 and pUC19, and of pMV201 from pMV200, is shown in Figure 
11. Plasmids pMV200 and pMV201 were transformed into 
smeqmatia and BCG. Both plasmids yielded Kan^ transf ormants. 
thus indicating their ability to replicate in mycobacteria. 

A synthetic multiple cloning sequence (MCS) (Figure 12) was 
then designed and synthesized to facilitate versatile molecular 
cloning and manipulations for foreign gene expressions in 
mycobacteria, and for integration into the mycobacterial 
chromosome. The synthetic MCS, shown in Figure 12, contains 16 
restriction sites unique to pMV201 and includes a region carrying 
translation stop codons in each of three reading frames, and a 
transcription terminator derived from E. coli 5S ribosomal RNA 
(Tl). 

To insert the MCS cassette, pMV201 was digested with Narl 
and Nhel, and the resulting fragment was gel purified. The MCS 
was digested with HlnPI and Nhel and, the resulting fragment was 
gel purified. The two fragments were then ligated to yield 
pMV204. A schematic of the construction of pMV204 is shorn in 
Figure 13. 

Planild pMV204 was then further manipulated to facilitate 
removal of the M.rep cassette in further constructions. pMV204 
was digested with Nlul, and an Mlul - Not I linker was inserted 
into the Mlul site between the M.rep and the E.rep to generate 
pMV206. A schematic of the construction of pMV206 from pMV204 is 
shown in Figure 14, and the DNA sequence of pMV206 is given in 
Figure 15. 

5. Construction of expression cassette based on BCG HSP60. 
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The HSP61 cassette (Figure 6) was constructed as hereinabove 
described in Example 1. 

Each of pMV206 and the PGR cassette HSP61 was digested with 
Xbal and BamHI . The PGR cassette was then inserted between the 
Xbal and BamHI sites of pMV206, then ligated to form piasmid 
pMV261. The construction of this plaamid is shown schematically 
in Figure 17. The reading frame and the restriction sites of the 
multiple cloning site of pMV261 is shown in Figure 16. 

The E. coli lac 2 gene was used as a reporter, or marker 
gene to assay the ability of the HSP61 cassette to express 
heterologous genes in BCG. A BamHI restriction fragment carrying 
the lac Z gene was cloned into the Bam HI site of Bam HI digested 
pMV261, resulting in the formation of pMV261/L2. A schematic of 
the construction of pMV261/LZ is shown in Figure 18. The 
formation of pMV261/LZ results in a fusion between the HSP60 anO 
lac 2 genes at the sixth codon of the HSP60 gene and the sixth . 
codon of the lac Z gene. p^fV261/LZ was then transformed into E^ 
coli . Blue E. coli colonies were selected on x-gal plates for 
the presence of pMV261/L2, thus indicating that the HSP60 
promoter and translation initiation sequences were also active in 
E. coli . 

pMV261/LZ was then transformed into BCG and plated on Dubos 
Oleic Agar plates containing x-gal. All BGG colonies resulting 
from this transformation exhibited blue color, thus indicating 
that the lac 2 gene product (B-galactosidase) was expressed in 
BGG, SDS polyacrylamide gel electrophoresis was performed on 
lysates of' the pMV261/L2 BGG recobinants, revealing that 
B-galacto«idase protein was expressed to levels in excess of 10% 
of total BCG protein (as determined by staining with Coomassie 
brilliant blue). These data indicated that BGG HSP61 expression 
cassette was functional in expression vector pMV261. 

Piasmid pMV261/L2 was then shown to replicate autonomously, 
and express the E. coli B-galactosidase, or lac2 gene, driven by 
the BGG promoter HSP60, in w. smeomatis and BCG. 
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^ Tra nsfer of m ycobacterio ph aqe L5 integ ra tion seouenc^^ 

to BCG expreasion vector. 

Plasmid pMH9.4, which includes the mycobacteriophage L5 attP 
site, and the L5 integrase gene, was employed in providing the L5 
integration sequences to a BCG expression vector. The 
construction of pMH9.4, as well as its integration into 51^ 
smeqmatis and BCG, is described below in sections (i) through 
(vi) . 

(i) Identification of the DNA sequences of the attachment sites, 
attB. attL. and attR, of M, smeqmatis. 

Using standard technologies, a lambda EMBL3 library was 

constructed using chromosomal DNA prepared from mc^61 (a strain 

of M. smeqmatis which includes an M. smegmatia chromosome into 

which has been integrated the genome of mycobacterial phage L%) 

and digested with Bam HI. Phage L5 contains DNA having 

restriction sites identical to those of phage LI (Snapper, et al . 

1988), except that L5 is able to replicate at 42®C and phage LI 

is incapable of such growth. This library was then probed with a 

6.7 kb DNA fragment isolated from the L5 genome that had been 

previously identified as carrying the attP sequence (Snapper, et 

al 1988). One of the positive clones was plaque purified, DNA 

prepared, and a 1.1 kb Sal I fragment (containing the AttL 

sequence) sub-cloned into sequencing vector pUC119. The DNA 

sequence of this fragment was determined using a shotgun approach 

coupled with Sanger sequencing. By isolating and sequencing the 

attL junction site and comparing this to the DNA sequence of L5 

that was available, a region was determined where the two 

sequences aligned but with a specific discontinuity present. The 

discontinuity represents one side of a core sequence, which is 

identical in AttP, attB, and attL. The region containing the 

recombinational crossover point is shown in Figure 19. 

The attL DNA (1.1 kb Sal I fragment) was used as a probe to 

2 

hybridize to a Southern blot of Bam HI digested mc 6 DNA, which 
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is a strain of M, smegmatis which includes an M. smeomatis 
chromosome without any phage integration (Jacobs, et al, 1987, 
hereinabove cited.)- A single band of approximately 6.4 kb was 
detected corresponding to the attB sequence of M. smeomatis . 
This same attL probe was used to screen a cosmid library of mc^6 
(provided by Dr. Bill Jacobs of the Albert Einstein College of 
Medicine of Yeshiva University), and a number of positive cosmid 
clones were identified. DNA was prepared from these clones, and 
a 1.9 kb Sal I fragment (containing the attB site) that 
hybridizes to the attL probe was subcloned into pUC119 for 
sequencing and further analysis. The DNA sequence containing the 
core sequence was determined and is shown in Figure 19, The core 
sequence, which is identical in attP, attB and attL, has a length 
of 43bp. 

The mc 61 lambda EMBL3 library was then probed with tTie 
1.9kb Sail fragment containing the attB site. Positive plaques 
were identified, DNA was prepared, and analyzed by restriction 
analysis and Southern blots. Lambda clones were identified that 
contained a 3.2kb Bam HI fragment containing the putative attR 
site. The 3.2kb Bam HI fragment was purified and cloned into 
pUC119 for sequencing and further analysis. 

(ii ) Determination of attP-intearase region of L5 genome . 

Concurrent with the above procedures, a significant 
portion of the DNA sequence of L5 had been determined and 
represented in several "contigs" or islands of DNA sequence. 
Sequences of the 6.7kb Bam HI fragment hereinabove described were 
determined by (a) analysis of the location of Bam HI sites in the 
contigs of the DNA of LS, and (b) by determining a short stretch 
of DNA sequence from around the Bam HI sites of plasmid pJR-1 
(Figure 24), which carries the 6.7kb Bam HI fragment of LS. 

A segment of DNA sequence was located that represented the 
6.7kb Bam HI fragment of phage L5 . Studies of other phages have 
shown that the integrase genes are often located close to the 
attP site. It was thus determined that the LS integrase (int) 
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gene should lie either within the 6.7kb Bam HI fragment or in a 
DNA sequence on either side of it. The DNA sequence m the 
regions was then analyzed by translating it into all six possible 
reading frames and searching these amino acid sequences for 
similarity to the family of integrase related proteins, and 
through computer-assisted analysis of the DNA sequence. As shown 
in Figure 20, there are shown two domains of reasonably good 
conservation among L5 integrase and other integrases, and three 
amino acid residues that are absolutely conserved in domain 2. 
(See Yagil, et al . , J. Mol. Biol. . Vol. 207, pgs. 695-717 (1989), 
and Poyart-Salmeron, et al., J. EMBO. , Vol. 8, pgs. 2425-2433 
(1989)). A region was identified, and analysis of the 
corresponding DNA sequence showed a reading frame that could 
encode for a protein of approximately 333 amino acids. These 
observations identified the putative int gene. 

The location of the int gene was not within the 6.7kb Bam -HI 
fragment; however, it was very close to it with one of the Bam HI 
sites (that defines the 6.7kb Bam HI fragment) less than 100 bp 
upstream of the start of the gene. Analysis of the Bam HI sites 
showed that the int gene lay within a 1 . 9kb Bam HI fragment 
located adjacent to the 6.7kb Bam HI fragment. This 1 . 9kb Bam HI 
fragment was cloned by purification of the fragment from a Bam HI 
digest of L5 DNA and cloning into pUC 119, to generate pMHl 
(Figure 25) , 

From a combination of the above approaches, a schematic of 
the organization of the attP-int region of L5 was constructed 
(Figure 26), and the gene sequence of the attP- int region is 
given in Figure 22. 

(iii ) Construction of pMH5 . 

The 6.7kb Bam HI fragment of mycobacteriophage L5, which 
contains the attP site, as hereinabove described, was cloned into 
the Bam HI site of pUC 119 (Figure 23). This was achieved by 
purifying the 6.7kb Bam HI fragment from a Bam HI digest of L5 
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DNA separated by agarose gel electrophoresis and ligating with 
Bam HI cut pUC 119. DNA was prepared from candidate recombinants 
and characterized by restriction enzyme analysis and gel 
electrophoresis. A recombinant was identified that contained the 
6.7kb Bam HI fragment of L5 cloned into pUC 119. This plasmid 
was named pJR-1, as shown in Figure 24. 

Analysis of DNA sequence data from a project to sequence L5 
showed that a 1.9kb Bam HI fragment adjacent to the 6.7kb Bam HI 
fragment hereinabove described contained the integrase gene. 

A plasmid containing a 1.9kb Bam HI fragment containing the 
DNA encoding for the integrase cloned into the Bam HI site of pUC 
119 was constructed. The 1.9kb fragment was purified from a Bam 
HI digest of L5 DNA and cloned into the Bam HI site of pUC 119. 
Construction of the recombinant was determined by restriction 
analysis and gel electrophoresis. This plasmid was called pMlfl, 
the construction of which is shown schematically in Figure 25. 

pJR-1 was then modified by digestion with EcoRI and SnaBI 
(both are unique cloning sites), between which is a Bam HI site. 
The EcoRI-SnaBI fragment, including the Bam HI site was excised, 
and the. plasmid was religated to form plasmid of pMH2, which 
contains on Bam HI site compared to two Bam HI sites contained in 
pJR-1. A schematic of the construction of pMH2 is shown in 
Figure 26. 

The 1.9kb Bam HI fragment, which includes the integrase 
gene, was purified from a Bam HI digest of pMHl and ligated to 
Bam HI digested pMH2. Recombinants were identified as above and 
the orientation of the 1.9kb fragment determined. A plasmid 
called pMH4 wae thus constructed (Figure 27) in which the region 
from the Sna HI site (upstream of attP) through to the Bam HI 
site (downstream of the integrase gene) was identical to that in 
L5. 

pMH4 was digested with Hindi I I (unique site) and was ligated 
to a Ikb Hindi I I fragment purified from pKD43 (supplied by Keith 
Derbyshire of the Nigel Gindley Laboratory) that contains the 
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gene determin ig resistance to kanamycm. .RecQi^inant^ m^j:^ 
identified and characterized as above. This piasmid i"^ <fa11^^ 
pMH5. A schematic of the construction of pMH5 is shown in Figure 
28. 

fiv) Integration of pMH5 into att B of M. smeqmatis. 

Plasmids pYUB12 (a gift from Dr. Bill Jacobs, a schematic of 
the formation of which is shown in Figure 1), pMDOl (Figure 29), 
and pMH5 were electroporated, with four different concentrations 
of plasmid DNA over a 1,000- fold range, into M. smegma tis strain 
mc^l55, a strain which is able to support plasmid replication. 
In sections (iv) through (vi), all electroporation procedures of 
M. smegmatia , or of BCG, were carried out as follows: 

Cultures of organism were grown in Middlebrook 7H9 media, as 
described by Snapper, et al. (1988), harvested by centrifugation, 
washed three times with cold 10% glycerol, and resuspended at 
approximately a 100 x concentration of cells. 

1 Ml of DNA was added to 100 m1 of cells in an ice-cold 
cuvette and pulsed in a Bio-Rad Gene Pulser, and given a singTe 
pulse at 1.25 kv at 25 {iF. 1 ml of broth was added the cells 
incubated for 1 hr. at 37**C for expression of the antibiotic- 
resistant marker. Cells were then concentrated and plated out on 
Middlebrook or tryptic soy media containing 15 \xg/ml kanamycin. 
Colonies were observed after 3 to 5 days incubation at 37®C. 

Each of pYUB12, pMDOl, and pMHS carries kanamycin 
resistance. Plasmid pYUB12 carries an origin of DNA replication, 
while pMDOl lacks a mycobacterial origin of replication. Plasmid 
pNHS does not carry a mycobacterial origin of replication, but 
carries a 2kb region of phage L5 which contains the attP site and 
the integrase gene (Figure 22), The number of transforraants were 

linear with DNA concentration. Plasmid pYUB12 gives a large 

5 2 
number of transf ormants (2 x lO"" per ug DNA) in mc 155, while 

pMHS gives 6 x 10* transf ormants per \xg DNA, and pMDOl gives no 
transform ants. 
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The above experiment was then repeated by eiectropcratmg 

the plaamids pYUB12, pMDOl, and pMH5 into M. smeomatis strain 

mc^6, which does not support plasmid replication. No 

2 

transformants in mc 6 were obtained from pYUB12 or pMDOl, while 

4 

pMH5 gave approximately 10 kanamycin resistant transformants m 

mc^6 per lag of DNA, thus indicating integration of pMH5 into the 
2 

mc 6 chromosome. 

2 

DNA from six independent pMH5 transformants (four in mc 155 

and two in mc 6) was prepared. These DNA's (along with DNA from 

7 2 
both mc 155 itself, and mc 155 carrying the plasmid pYUB12) were 

digested with a restriction enzyme, and analyzed by Southern blot 
and hybridization with the M> smegma tis 1.9kb attB probe 
hereinabove described. As shown in Figure 30, all six 
transformants have integrated into the attB site, resulting in 
the production of two new DNA fragments with different 
mobilities. If pMHS did not integrate into the attB site, it 
would be expected that a single band, corresponding to the attB 
site in the mc^l55 control, would be obtained. 
(V) Construction of pWH9.2 and PMH9.4 

pUC119 was digested with Hindlll, and a Ikb Hindi 1 1 
fragment, containing a kanamycin resistance gene, purified from 
pKD43, was ligated to the Hindlll digested pUC119 to form pMH8 
(Figure 31). A 2kb Sail fragment (bp 3226-5310), which carries 
the attP and integrate gene from Sail digested pMH5, was purified 
and inserted in both orientations relative to the vector backbone 
of Sail digested pMHB to form plasmids pMHg.2 and pMH9.4 (Figures 
32 and 33). 

M. meomatiB strain mc^l55 cells carrying, as a result of 
electroporation, plasmid pYUB12, pMH9.2 or pMH9.4, or strain mc 6 
cells carrying plasmid pMH5, as a result of electroporation as 
hereinabove described, were grown to saturation in broth with 
kananmycln. Cultures were then diluted 1:100 into broth without 
kanamycin and grown to saturation. Two further cycles of 
dilution and growth were done, corresponding to about 20 
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generations of bacterial growth. Cultures were plated out to 
single colonies on non-selective plates, and approximately 100 of 
these colonies were patch plated onto both non-selective and 
selective plates. The % of colonies that were sensitive to 
kanamycin, thus corresponding to the percentage of cells which 
lost the plasmid, is given below in Table I. 

Table I 

% loss 

pyUB12 (mc^l55) 35 

pMH5 (mc^6) 17 

pMH9.2 (mc^l55) 3 

PMH9.4 (mc^lSS) 0 
(vi) Transformation of BCG with dMH9.4 

The 1,9 kb Sal I fragment, which includes the M. smeamatis 
attB site as hereinabove described was cloned into pUC119, and 
the plasmid generated was named pMH-12. (Figure 34). 

Gel purified Sal I 1.9kb M. smeamatis fragment containing 
attB (Isolated from pMH-12) was used to probe a Southern transfer 
of Bam HI digested mycobacterial DNA's, including BCG substrain 
Pasteur, shovm in Figure 35. This demonstrated that there is one 
Bam HI fragment of BCG that strongly hybridizes to the M. 
smeqmatis attB probe and three hybridize weakly. The strongest 
hybridizing band is the fastest moving band (approximately 1.9 
kb) . 

The same probe as above was used to probe a BCG cosmid 
library (provided by Dr. Bill Jacobs) and positive clones were 
identified. DNA was prepared from several positive clones and 
analyzed by reetriction analysis and Southern blotting. The 1.9 
kb Bam HI fragment (corresponding to the strongly hybridizing 
band in the Southern blot was identified, gel purified from the 
cosmid DNA and cloned into pUC119. The resulting plasmid was 
named pMH-15. (Figure 36). 

Plasmid pMH-5 and pMH9.4 were electroporated into BCG 
Pasteur. It was observed that pMH9.4 transforms BCG with high 
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efficiency (approximately 10 transf ormants/ug DNA), while pMH-5 
transforms BCG at low efficiency (1-10 transf ormants/ug DNA) . 
DNA was prepared from BCG transf ormants and analyzed by Bam HI 
restriction and Southern blot analysis, probing with gel purified 
1.9kb Bam HI BCG attB fragment from pMH-15. These data are shown 
in Figure 37 and show that integration of both pMH5 and pMH9.4 is 
specific to the BCG attB site (ie. the strongly cross-hybridizing 
fragment in BCG). This is illustrated by the loss of the 1.9kb 
Bam HI fragment from the transf ormants and the appearance of two 
new bands representing attL and attR junction fragments. Figure 
37 shows just one of the pMH5/BCG transf ormants , although all of 
the four that were analyzed show that one of the bands (the 
largest) is smaller than expected (and different in each of the 
transformants) , indicating that the transformation effiency of 
pMH-5 is low in BCG. In contrast, the four pMH9.4 transformants 
are identical to each other (Figure 37) and give attR and attL 
junction fragments of the predicted sizes. 

Plasmid pMH9.4, which includes the mycobacterial phage L5 
attP site and the L5 integrase gene, was digested to completion 
with either Kpnl ^ PvuII or Xbal + PvuII, and a restriction 
fragment of 1862 or 1847 base pairs, respectively, each of which 
contain the attP site and the integrase gene, were purified by 
agarose gel electrophoresis. Plasmid pMV261/LZ was digested with 
Xbal or Dral to generate either a 7569 bp or 7574 bp vector 
fragment. The 7569 bp fragment was ligated to the 1862 bp 
fragment derived from pMH9.4 to form p^fV460/LZ. The 7574 bp 
fragment was ligated to the 1847 bp fragment derived from pMH9.4 
to form pMV460 R/L2. Plasmids pMV460 F/LZ and pMV460R/L2 each 
include a mycobacterial replicon, the L5 attP site, and the L5 
integrase gene. A schematic of the formation of plasmids pMV460 
FAZ and pMV460R/LZ is shown in Figure 38. To generate 
derivatives without the mycobacterial plasmid replicon, plasmids 
PMV460/LZ and pMV460/LZ were digested with NotI and 
recircularized by ligation to generate pMV360F/LZ and pMV360R/LZ. 



SUBSTITUTE SHEET 



wo 92/2 1 376 ^ ^ ^ ^ PCT/US92/04538 

A schematic of the construction of pMV360F/L2 and pMV360R/L2 is 
shown in Figure 39. 

Plasmids pMH9,4, pMV261/L2, pMV460/LZ, pMV460/L2, pMV450/L2, 
and PMV460/LZ were then transformed into M. smeomatis and BCG to 
test their ability to replicate autonomously or integrate into 

EL — smeqmatis or the BCG chromosome. Transformation with 
PMH9.4, PMV261/LZ, pMV360F/L2, and pMV360R/LZ yielded kananmycin 
resistant transf ormants of M. sraeomatia and BCG. Transf ormants 
of PMV261LZ, PMV360F/LZ, and pMV360R/L2 were shown to express 
coli B-galactosidase by SDS-polyacrylamide gel electrophoresis 
and X-gal assay. Plasmids pMV460F/LZ and pMV460R/LZ failed to 
yield kanamycin resistant transf ormants, thus indicating that 
chromosomal integration of a plasmid carrying sequences mediating 
autonomous replication is lethal to mycobacteria. 
7> Construction of pMV307. 

Plasmid pMV206 was digested with NotI to remove the 
mycobacterial replicon. The resulting 2209 bp fragment, which 
includes the aph (Kan^) gene, the E. coli replicon and the 
multiple cloning site, was ligated and recircularized to form 
pMV205, the construction of which is schematically depicted in 
Figure 14. 

PGR with primers Xbal-Att/Int and Nhel-Att/Int was then 
performed on a Sal I fragment from pMH9.4, which contains the 
attP site and the L5 integrase gene. The resulting cassette was 
then digested with Xbal and Nhel and a 17B9 bp fragment was gel 
purified. pMV205 was then digested with Nhel, and the resulting 
fragment was ligated to the 1989 bp fragment obtained from pMH9 . 4 
to form pMV307. A schematic of the construction of pMV307 is 
shown in Figure 40. 

8. Construction of pMV261/HIV1-qp 120 . 

An SmaI"ClaI antigen gene fragment, or cassette, was 
constructed by PGR, and cloned between the Bam HI and Clal 
restriction sites of pMV261 to form pMV261/HIVl-gp 120. 
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Plasmid pMV261/HIVl-gp 120 was transformed into BCG, and the 
presence of the corresponding antigen in BCG was verified by the 
appearance of immunoreactive protein bands in Western blot 
analysis of BCG recombinant lysates. 
9, Construction of pMV361/HIVl" gp _1 20 

The HIVl-gp 120 antigen gene expression cassette, which 
includes a promoter sequence and an HIVl-gp 120 gene sequence, 
was excised from the pMV261 derivatives with NotI and a second 
restriction enzyme site (Pvu 11, Eco RI, Sal I, Cla I or Hind 
III) and cloned into the integrating plasmid pMV307 between the 
NotI site and a second enzyme site (Pvu II, Eco RI, Sal I, Cla I 
or Hind III) to form the plasmid pMV361/HIV-Igpl20 , The backbone 
of this plasmid is shown in Figure 41. 

Plasmid pMV361/HIV-Igpl20 was transformed into BCG and shown 
to express the corresponding antigens by Western blot analysis 
(Figure 42) with the appropriate anti gen- specif ic human sera. 

Examp le 4 

Cytotoxic T lymphocyte responses to HIV-l-g p 12 0 

HIV-1 gp 120 was expressed in BCG as a six amino acid fusion 
protein with BCG hsp 60 protein using vector pMV361/HIV-l-gpl20, 
using the hsp 60 promoter to control expression. 

Two groups of mice were inoculated with 1 X 10^ CFU's of 
recombinant BCG expressing the gp 120 gene from the integrative 
plasmid pMV361/HIV-l-gpl20 . One group received the BCG via 
intraperi tonal injection (100 ul) whereas the other group 
received the BCG by deposition of the dose (10 ul) rubbed into a 
tail scratch, 

CTL activity was measured at various times after 
immunization. CTL activity was measured as follows: 

Two mice from each group were sacrificed at various times 
after immunation, and the spleens were removed. Single cell 
suspensions were made and the red blood cells were lyzed with 
ammonium chloride. The cells were stimulated in vitro for 5 days 
with P815 cells, that were pulsed with peptide PIS, a fifteen 
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residue synthetic peptide within HIVl-gp 120. A 4-hour chromium 
(Cr ) release assay was then carried out using untreated P815 
and peptide P18 pulsed P815 cells as targets. Significant 
PI8-specific CTL activity was observed in the mice immunized by 
tail scratch 14 weeks after immunization. At 16 weeks, CTL 
activity was observed in both groups of mice. Upon repeat of 
this experiement, CTL activity was observed at time points as 
early as 8 weeks after immunization. 

Example 5 

Recombinant BCG transformed with pMV361/HIVl-gp 120 were 
grown to mid-log phase in Dubos media and concentrated by 
centrifugation. The bacteria were then resuspended in 15% 
glycerol and frozen using a rate contolled freezing apparatus. 
The bacteria were stored at -TO'C until use (referred to as 
"vaccine"). A second preparation grown in the same way was not 
frozen and is referred to as a "fresh" preparation. Prior to 
immunization of animals, the bacteria were resuspended in PBS + 
0.05% Tween 80 to the desired concentration and cup sonicated 
briefly to disperse clumped bacteria. Six week old BALB/c mice 
were inoculated with a single dose of 5 x 10* cfu fresh bacteria 
(determined post inoculation) or 1.5 x 10^ frozen bacteria 
(determined pre-inoculation) by tail scratch (t.s.) injection. 
At 8 weeks post-immunization, splenocytes were harvested from 
animals and CTL activity was measured (described below) . 
Splenocytes from unimmunized animals were used as controls in the 
CTL assays. 

CTL activity was determined as follows: 

Splenocytes (ACK-treated, 5 X loVml) were stimulated in 
vitro in 10 ml in upright T25 flasks by co-culture for 5 days 
with mitomycin C-treated P815 cells (5 X lO^ml) that were pulsed 
with 250 ug/ml of peptide P18 for one hour. A 4 hr. ^^Cr release 
assay was subsequently performed in triplicate using P815 targets 
with or without pulsing for 1 hour with 250 ug/ml peptide P18. 
Various effector-target ratios were tested using 5000 
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targets/well . Specific lysis was calculated as follows: % 
specific lysis = 100 X [release by effector cells minus 
spontaneous release/maximal release minus spontaneious release] . 
The results are given in Figure 42 , 

As shown in Figure 43, both groups of mice showed an 
increased CTL response at 8 weeks after immunization as compared 
with unimmunized mice. 

Example 6 

Recombinant BCG transformed with pMV361/HIVl-gp 120. were 

grown to mid- log phase in Dubos media and concentrated by 

centrifugation. The bacteria were then resuspended in 15% 

glycerol and frozen using a rate contolled freezing apparatus. 

The bacteria were stored at -70°C until use (referred to as 

"vaccine"). A second preparation grown in the same way was not 

frozen and is referred to as a "fresh" preparation. Prior to 

immunization of animals, the bacteria were resuspended in PBS + 

0.05% Tween 80 to the desired concentration and cup sonicated 

briefly to disperse clumped bacteria. Six week old BALB/c mice 

4 

were inoculated with a single dose of 5 x 10 cfu fresh bacteria 
(determined post inoculation) or 1.5 x 10^ frozen bacteria 
(determined pre-inoculation) by tail scratch (t.s.) injection. 
At 8 weeks post- immunization, splenocytes were harvested from 
animals and CTL activity was measured (described below). 
CTL activity was determined as follows: 

Lymph node cells (5 x 10^/ml) were stimulated in vitro in 10 
ml in upright T25 flasks by co-culture for 5 days with mitomycin 
C-treated P815 cells (5 X loVml) that were pulsed with 250 ug/ml 
of peptide P18 for one hour. A 4 hr ^^Cr release assay was 
subsequently performed in triplicate using P815 (matched ) or EL4 
(mismatched) targets with or without pulsing for 1 hour with 250 
ug/ml peptide P18. Various effector target ratios were tested 
using 5000 targets/well. Specific lysis was calculated as 
follows: % specific lysis = 100 x [release by effector cells 
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minus spontaneous release/maximal release minus spontaneous 
release] . 

The results of this assay are given in Figure 44. 

As shown in Figure 44, a CTL response to HIV-1 gp 120 using 
lymph node cells was demonstrated following immunization of mice 
with BCG transformed with pMV361/HIV-l gp 120. 

It is to be understood, however, that the scope of the 
present invention is not to be limited to the specific 
embodiments described above. The invention may be practiced 
other than as particularly described and still be within the 
scope of the accompanying claims. 
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WHAT IS CLAIMED IS: 

1. A method of inducing a CTL response in an animal 
comprising: 

administering to an animal mycobacteria transformed 
with at least one DNA sequence which encodes a protein or peptide 
or fragment or derivative thereof which includes an epitope which 
is recognized by cytotoxic T lymphocytes, said mycobacteria being 
administered in an amount effective to induce a CTL response in 
an animal. 

2. The method of Claim 1 wherein said protein or peptide 
or fragment or derivative thereof includes an epitope which is 
recognized by cytotoxic T lymphocytes induced by an HIV protein 
or fragment or derivative thereof. 

3. The method of Claim 2 wherein said protein or peptide 
or fragment or derivative thereof is an HIV protein or fragment 
or derivative thereof. 

4. The method of Claim 1 wherein the mycobacteria are of 
the species M.bovis ^BCG. 

5. A composition for inducing a CTL response in an animail, 
comprising: 

mycobacteria transformed with at least one DNA sequence 
which encodes a protein or peptide or fragment or derivative 
thereof which includes an epitope which is reocognized by 
cytotoxic T lymphocytes, and an acceptable pharmaceutical 
carrier, said mycobacteria being present in an amount effective 
to induce a CTL response in an animal. 

6. The composition of Claim 5 wherein said protein or 
peptide or fragment or derivative thereof includes an epitope 
which is recognized by cytotoxic T lymphocytes induced by an HIV 
protein or fragment or derivative thereof. 

7. The composition of Claim 6 wherein said protein or 
peptide or fragment or derivative thereof is an HIV protein or 
fragment or derivative thereof. 
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8. The composition of Claim 5 wherein said mycobacteria 
are of the species M.bovis-BCS. 
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GCTAGCICTATATCXnTTXUTGCAAATITCTAT^^ 

1 + + ^ + • 

CXUTCXXXATATAaXZAACTACCTrAAAGATACGOTI^^ 

Bern -M. rep Hn^;«.^ft 

CGAGCTACTATCGACTACGCGATCLiTX^GCGA^^ 

101 + + + + + 

anXXXrroATAGCTCATCXXXTAGTAOXXnXX^ 

TitTixnxxxxTCXXXxxxn'rGccrio 

201 + ^ > + * • 

AACAGACGGAGGGGCGCAACCCAGCGCCACtTrACCT^^ 

ATIXXiiGOCAATCAATrcntXXX^G^ 

301- + + + + + 

TAAOCnXXXTmCTTAAGAACGiXTCTrcACAC^ 

GCATCTCXXXXL\GCGrrcCT(XTGC^ 

401 + + + + - 

OnAGAGODCXTroGCAAaXZAGGACXXXr^^ 

GAATGAATCLiCOGATACGCCACCGAACGTGAAGCCAC^^ 

CTTACm O TOCCT A TOCO CItAXJlltj CACTIt^^ 

TAAAGriXnXX;AAAOCCXX;AAClXLiGCXXXXnXX^ 

601-- + + + + " 

ATITCAGACXnTItXXXXnTCAGTXXXXXX^AC^ 

CD 

TAA(X;AAGOCCIXXXL\TTGACarrcAGrrcAT^^ ^ 

701 + + + ^ + • • 

ATIXXnTXXCGACCGTAACTGGGACnr-ACrAA^ ^ 

TTCATCATCAGTAACXXXTrATCXnr^ ^ 

801 + ^ + - 

AAGTAGTAGTD^TTTXXXI^TAGCACTXXTAGCAGACACX: ^ 
OCL^CAGGAAAAAAOOGCCXnTAACATGGOCi^^ 

901 + + + + + - ^ 

OCTITCTIXXTITITnXXXXXlAAT^^ ^ 
AGACATCnTTrGAATOCCTIUACGACCAaxnnA -j. 

1001 + + + + + • u 

TCirTTAGACACTrAGCGAAGTGCnxnTX^ I- 

CACTCAAiXTCGAAGCGTGriTXTI^^ ^ 

1101 + + + + ^ 

GTCAGTIXX;AACITCGCACAaL\ACGantXX^ 

GAACTCACTCXTATXXTlXXTrAAACXTAG^ 

1331 + + + 

CTCGACTGAGCATAGCAAIXATTTXXIATCAAAC^^ 

UBux IT 

AiXAGAGTCGOCACCGATGCCACCACAAGC^ CX7 

1301 ♦ + + - 

TGCTXntlACCGGTGCCTACGGTGCIXTITC^^ 
CGAAATCXTTTTiGTATrcAaLVAGATTT^ 

1401 + + + + + - 

CCITTACCGAACrATAGCTGGTTCTAAGCAlXrn^^ 
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ACTcnxxxi^cccxrmtxxrGcocxxxiAcrixxTxx^^ 

+ * + + + 100 

r.ArAr/xnr,r^,AAAccr.r^GGor.GcncAGGAOGAC.^^ 
aoACCocx;ACGCATcx7rcGaxxK:ATc:Acal^ — 

AGATGCXr.OCntXX7rAGr^CaXXX3CTAC7IT^^ ^ 
ant^TCGAAGCOXXXX'.rAtXTTXXTrAACTX^^ 

CCCATAACATATtTATCXXXnaXKXIATtn^ 

OCrnCTTtn-ATAGGTAGOGCAWKXXTrAGAixn^^ ^ 
TOTITCAGGACraXXTAOCXntXXXXXXTmXXTrACT^^ 
ACACCAACTCCTCXXXXXATXXX1\OCGOOCCAAOCX}AA^ 

trnnxxjGAorrcAGCAACAACATGAATGo 1 LI i u - x/1 1 i ax. 1 urn tx; 

AGACGCltX;ACTCXnTXnTCTACTrAOCAGAAGOCAAAGGCACAAAGC 
GCATCGCAGGATXXnXXnXXXTACXXTCTCGAACACCrACATXnXTrAT 

CGTACan€CrACCAOGACCGATXKXMCXAATn7rc<^^ 
CCATAOCGCCACTIXTmACOCTCACAACCnxrAGTAACa 

4-. 

GGTATr.C/XXriXLUCAAATCXX^lGTCTItX:AAGGTCATIt^^ 
ATrACCLTCATGAACAGAAAllLLUX-IIACAaXUGCCATCAACTIXU 

GTAATGG CGGTAC 1 IP ICII 1 A AGGCGGAATGIXXJL lULVrAGTTCACT 
CATTAAOCCIlCJWSAGAAACKXAaSAGGTtX^AaXXXUT^ 

GTAATTGCGAAGACCIXTTItUGTIXXnXXUCX^P^^ 
GAGGACAGTrXXIACIUOGAAGllLJiUltXUTOGCGCCaTIXyiU; ^ 

CIXXlXTrc^GCCItXTIXXTTCAAGAAGACXTAGOC^ 
CATGCGCAACXIAACCCOGCAACGAACAACXXXTAGAACTOOCACTAGAT 

TCTACGOnTXXTIXXKrXXTnXXTIXn^^ ^ 
CCTTCGCTCAGCnTIXXACGGGGCCIXTrAAOGGCACAACGAACaJI^ 

GCAAGOCACTCGACACTItXraXXUCATlTXXXTItnT^^ 



TACATCACXACAACCAOCGATITntXXXXTrcACCTCCAOGAT^ 

ATXTTACTXXTIXTmxnXXTAAGACaXrACraJAGGTGCTAT^^^ 
GTATICAAAACGGACCCAAaJAAACACGCAAaSAGACAGGCATCGOCC 

CCATAAGTTntXrTGCGTIXXnTnritXXnTGCIX:^ 
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AAA(XAGAAAACTAOCCTCTA(XAtKL\CTm 
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• ^. 4.. ..... — + 

TmxnxnTmUTCGCAGATCGTO^^ 

GAACAGCGGTCGATIXTnXXXnTCGT^^ 
1601 + ♦ ♦ 4-. 

CTTCTTXJGCCAOOrAACAGOCGA^ 
eFLPPNDAENEAKQAAEQ 

TDCAGATCCAGCaX^AAAlUlJILO^ 

17D1 ♦ ♦ ^ + + 

ACGriCTAOGriO]«GCTTrACAAAa:GGC^ 

c LELGFMKATQPWSEGEDr 
OCXjUCIXXXnXXXTntXTOCOCCAOGAG^^ LJ_ 



Iffil- 



1901- 



ajGTGAOCGAa3CAAGGAOCXXXnXXnTX.OCC^^ 
NES RE QA VL RVVHRKSL 

GACACTIXXXXnUCXXXTITCTAGra 

1 + + + + + 

CTXTIXZAOCXXUCGGCCAACATCXXXLiG^ 
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ClTnX5CKXXX3GaX7IXX3GKXXnX^ 
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CACACGAOOCXXXXCACOXCGAC^ 
d ISAATRAAAGKR SIASQ 
c HEGGWAS SRGE ALHGVP 



CD 

d 



u 



< 

(5 



CH^CKXjACTTGGTITnXJATCCAACXX 

2101 + + + + + - 

CXXXXnt^AACCAACACTAGC7mXXX7ITrACGA(^ 
dRSXTTIWRWISWAlARV 
c AVQNEDLALEQQRERPG 

GCGTniXXXXXritX;CACrCGGCATAGATrr^^ 
22DI + ♦ + + ^ 

0GCAAAGCGOGCACXXnx;AGCCCnATCTAGCT»aXXGGCr^^ 
^ NRARCEAYIARGLGDV 
cRKARPVRCLDRPRTRGR 

ACXnXX\aXIAATCXlAACA(7IXKXXL^ 

2301 + + + + + 

TGGACIXXXTTACCITGTCACGCCTTAAGGCGG^ 

d RVSDFLACNRGLPTPAA 

c QRPRVTRLEARAADSGG p! 

B.9.IJJ (j 
CAGCnXGOCnXXL\TGIXXXnX3AGTXriX7r^ _ 

2101 + + + + + " , 

CTXX;AGCX:GGAGCrACACCGACTCACACATaCT UL 

dLEAEIESLTYLDEIILWET 

c ARGREPQTELSRLPA MG 
Sa,I,I C-T pgr MUTAGENOSISpHV 110-3005 
CTGCGGTOXXXITCGACGCO^^ 

2501 + + ^ + + 

CACXXXZAGCGGCAOCTTTrCOrXTXTf^^ 

d RDGDVARRL GEACAAMT 

c QPRRRRRASPRRRV 

TGAGTCXXXL^CACTOCGlXTTGOCnrxX^ 

260] ^ * + ^ ^' 

ACTCj^OGGGTTTTCL^CXC^CAOGCAOCGGC^ 

d LAWVAEANGNARNGVIA 
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CXXLlACCOCXXAOMAAro^ 

+ ♦ + + + leoo 

txxnTGcoGGcrcccrrGCOCccccT^^^ 

mritxxrcAcccixTnxxTc^^ 

^ + + ^ + 17D0 

AAGACGGOGTCXXL4GiAACGAaXXXXTATCCG 

KAAREKRARYGLRKVTD 

CGTCXnXUTAGGCGOGGATXXXTnXICCt^^ 

^ + + + + 1800 

XXLiGCACTATXXGCXXXTACGCAAGCGOXXLlCGT^^ 

0 DEYARIRERRAAQEAL 

AGT00CX7IXUTIXT.ACCCOCTItXXXXXXXX^ 

O * + ♦ ♦ + + 1900 

~ CAGGCCACTAAGCIXXaXiiACCCGCCGaL\GTCKX^^ 

^ GTIRAGEAATVRRKKR 

jj; cix:ATAGCAAix»cnx:c4iiKxnt];Aa^ 

+ . - . + + — - + 2000 

S GAATATDGTrACGGAGGTAOOGACltXXXXnXUAACGCGOCX^^ 

*LLA SMA SA SKARRAV 
X MATGGESVRVKRAACS 
^ GaXrCCACTCAGTOIXKXXTIXXrrAGAOCACGAT^^ 

< * ■*■ + + + 2100 

2 CG<XCCrcACl€ACAOOGGAGCATCriXXnGCTAGGGCAG<^^ 

RAS LTAEYVVI GDAWI 

ACQTEGRLGRDRGG LE 
G-APCR MUTAGENOSIS ^ 
pV^ VJJ0-300S IRocE 
CTCGCIXTIXXCGTAGCCGTtrCGGACACAOGr^^ CAATTOC 

+ + + + + 2300 

CAGCGACACGCCATCGCCAGCCCCIXTIXTItX:^ 

ESDPLPGPCVDNCPFEA 
RQGTAT RSVRRQV P IR 

rrOrCGGTtXXXIAGCTAGATtXXKlATCAGGGC^^ 

+ + + + 2300 

CAAGGCCCAGCCGTXXATCTACGCGTACrCaXXXXnXCT 
XRTP LVIRMLA PRYAWL 
EPDAPLDAEPRSSLGvy 

* ♦ + + + 3100 

GAAACATGCACCAGAOGACTOCGGKXXXXrGCCAC^^ 



m 

(5 

KYTTQQRWRPPPMWAG 

' qvedasalaataher: 

0 



CTnTCCCAGCCCATGTCCaXX XJ Ul 1 1 1 lU/IL ATCACGCXntUCrAA 
•""••"•+•"""""••* + "•••••••• + ••••••••• + ••••••»». ^ *yf^v\ 

— caaacggtctgctacacoggccocaaaaaccagtactcakl^ 

Ll ewaiiigpmktmlcsys 

NG L RWP Rp KQDEP RL L 

ATGOGAGaXXTTACGOCCCCiXTATinXTrCK^n^^ 

ATAOXnXXXXXJAATGCGCCGCGCATAAGCCACGCACXnTCT^^ ^ 
ALPKRRAYETRPVPAN 

r7nr,rxxL\Gor,CATWx;AccccoccGCGcrcACOW7^^ 

CXX\OCCGTCaXTAOCClGGGCGOCXXX}CUCTOXX^ ^ 
NPLPNSC RA SLA^RLAA 
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Fi o. y 



BamHT BcxhnHI 




3648 Sci II* 



FI O. S 

lll2EcoRV 

824ClaI 3079BQ.mHl 

a BamHI , 3005 EcoRT 

( I I J I 

Bo-mHI IctcZ CASSETTE 

3084 BASE PAIRS 



SUB^^TITUTE SHEET 



wo 92/21376 



PCr/US92/04538 



Fl O. 9 



14) 

wne I 6407 Nor I- 



•Agt II 4915 



AlwNI 4453 



Nda I 3864 

•Spl I 3217 
Dlqe$t withNorl and Boll, 
Fill 'n, Ligofe/Recircularize 




Mac 



1307 Bal I 



I 



. l46BomH I 

•Nhel5285 164 Nor I- 




845 Ncol- 
1071 EcoR I 



1309 Bgl II 
1389 Sal ) 

Att II 4271 



AlwN I 3808 
2096 Spl I- 



1967 Nco I- 

193 EeoR I- 
2431 Bgl II- 
2511 Son- 



Digest withNdel and Spl I, 
Fill in, Ligate /Recirculo r ize 



Kih. I Kya-i BamH I- 
•Nhe I 5763 ie4Nar l• 
,423Mac|. 



•Nd« I 2742 



Digest with BamHIQEcoRI 
and Isolate 1071 bp fragment 




307 Bal 



1967 Nco I- 

'2193 EeoR I 
2431 Bgl II 
2511 Sol I 



Ligate -^Digest with BamHI a EcoRI 
jl and Isolate 3570 bp fragment 



Nhe I 4641 W6 BamH I 
wne I 464! 184 Nor I 



Aat II 3149 

-AlwN I 2867 




845 Nco I- 
1071 EcoR |. 

1309 Bgl II- 
1389 So I I- 
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ID 



•Nhe I 4641 



l46BamH I- 
184 Nor I 



Aat II 3149 



•AlwNI 2667 
•ApaL I 2590 



PCR mutagenesis 
ofEcoRI 1071 




Fl O. lO 



845Ncol' 
1071 EcoR I 

1309 Bgl II 
1369 Sal I 



■Nhe I 4641 



146 BamH I 
184 Nar i 




PCR mutagenesis 
of Sail 1369 



146 BamH I 
164 Nor I; 

Nhe I 4841 



645Ncol 



1309 Bgl II 
1369 Sal I- 

•Aot II 3149 



AlwN I 2687 
•ApaL I 2590 



Digest with ApaLIt Bglll. 
Isolate 3360 bp fragment 




-AlwN I 2687 
•ApaL I 2590 




Digest with ApaLI-t- Bglll. 
Isolate 1261 bp fragment 



•Nhe I 464! 



l46 6omH I 
164 Nor I- 



Aat II 3149 



AlwN I 2687 
■ApoL I 2590 




645 Nco I 



1309 Bgl II- 
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Bel I 4101 
^•Sfll 4084 
Spl 14002 
•Hpol 3993 
:Sol I 3984 
•Clal 3976 
EcoR I 3966 
•PstI 3960 
„ Pvull 3957 
•BamH I 3952 
•Nca I 3948 
•Xbo I 3938 

;Kpn I 3922 
•Not I 3914. 



Fl 



■7 

O 



207 Nru I- 




•Bel i 4 
Spl I 4016 
•Hpa I 4007 
^Sall 3998 
^.ClQ I 3992 
•HinD III 3986 
•EeoR I 3979 
•P«t I 3974 
•Pvu II 3971 
•BamH I 3966 
•Nca i 3962 
•Xba I 3953 
•Dra I 3947 
•Bgl II 3942 
•Kpn I 3936 
•Not I 3928 



15 



I Nhel 



^ -Bel I 2044 20/1 Mlul. ^ 
■Sp I 2105 

c°.'.lS?® Dioest Mlul Bgl li 2797 
• Cla l 208®/ Mlul-NotI Linker 

•HinD III 2075 
EcoR I 2068 
•P»t I 2063 
•Pvu II 2060 
•BamH 1 2055 
•Ncol 2051 
•Xbo I 2042 
•Oral 2036 
eglll2031 



I pM' 

II M.rep 

\l\ 4120 b 



Kan""^ 



Digest Not I 
Ligate/Recircularize 



Kpnl 2025 
Wotl20l7 
•Mlul 2011 



INtiel 



pMV206 
base poirs 



E.rep 



2011 Mlu I- 
2017 Notl- 
2025 Mlu i- 




-Spe I 1219 
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TXXXnXXTCXXj%GTGGCXi\TAA(7ItX7IX;iCITACCXXX7nXX; GAC 

1501- - + ...-- + ♦ + + ' 

ACCGACGACXKrrcACOGCTATlt:AGCACAGAATGGCai\ACCTCA<^^ 

CACACAGaX^GCnXXMCaJAACGACCTACACCGAACrGAGATACCTACA 

IfiOl + + ♦ + + 

GTXnXTTOGGGTOCAACCTCGCTItKntXIATGTGGC^ 
AGCTATtXXXTTAAGCCGCACGCIXrGAACAGGAGAGCGCACXIAGGfiACCT 

^''^^TtiATAriiiATiTxxxxnccrAr.c^^ - 

TCTGACITGAGCCrrCGA Il 11 ll^llf ATCCTCGTCAGGOGGGCGGAGOCTA 

1801 + + + + + * „ 

AGACTGAACItXXllACCTAAAAACACTACGAGCAGTCaXCCGCClXXKSA y_ 

CCCTmXXTITLlCATGTirrrTCXnXXXJTTATCCCC^ 

1901 ♦ + * + + - ^ 

CGGAAAAOGAGTC7rACAAGAAAGGACGCAATAGGGr.ACTAAGACACCTA17 

AC6GAGOGCAACGCGTCXXXXXXXl\OCX:(nx;AGCCCACCAGCrCCGTAAt^^ J 
2D01 + + + + + - 1- 

TGGcnxcc7rnxxxjiCGa:GGa7itxxx:ACTctx^ < 

A0GGGTCTAAGGaXKXnX7rA0CXX:0GCCAC\GCGGCTCTCAG0G^ ^ 

2101- •»• + + + ♦ 

TCXrCXLiGATltXXiCXXXIACATGCCGGCCGriTTrCGCC^^ 

■ TCGGGGTGCTTXXXnXTraXnXXTIXTITCCACCACCACCGC^ 

2»1 ♦ ♦ ♦ ♦ ♦ 

... ACCOCCACGACCCGApiGCX;ACCACAAr/7Tr,CTG<7IXX:CG 

TGGAGCTCGTCTItXXIACCATACj^CCtXnTIATTAATCGTKGrr^ 
2301 + ♦ + + + . 

ACxnx:GAGCACACCxnxx7rATcrnxxrcAcrAATrAccA(:cAGATCATCX7ri 

GCCGCTGGCAAGCGACGATrirx:it?nAf;CXX;AIitTA(XX-,<:CAAAGC(TX'C; 

2401 + + + + + - 

(XXKXJACCCTn^CXnXXnVVGAACGAGCTCCtXTTAGATCGO^CTntX'^^ 



d 

in 
d 



A^VCClXXritXriXXTIXXIACGTAGACCATXXAGA^^ 

2501 + + + + + 

lTGGA(XA(XAGCACXnXX::ATOTX7rACGrr^ 
CCAAOCGCCAOaiACACGCAGTGTXXXXIACrc 

2601 + ♦ + + 

GGlTGCCCGTCCGTGTCXXTrcACACCaTrcAGTTGC^^ 

2701 + ^ + + + 

TCCGGAAGCCCCGCGGCVGCTACCGCIXXXrCTCAATGAGTCCGGAGTAC 

CTCTACACACTTj^GOL^CATCGAGGaXlAGCI^^ _ 

2801 + + ^ + + |i 

GAGATGTX7rcAGTCGGTGTAGCTCXGGCTCGAG(XGaX^^ ^ 

GCCGGAATTGCGCACTGTrcATTCCXT^ 

290] + + + + + - 

CCGCCTrAACGaTTCACAAGCrAAGGCACIXXAAOVrc 

CGCGATCTATTXrGAGTGCCA('r^^GA.\ArGrrGAAlI'ir:C^^ 

+ + * ^ 

GccxTFAGATACGGCTxiACcx; IX ,cTx:(K:rrix^/ 

AGCATrrcGCGTTCGATCACAACCAAGTCGCr^ 

3101 + + + + 

TCGTAAACCCCAACCrAGTGTGGTTO^GCGCGTAAACCC^^ 
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X ^7 

GATAGTrAOCGGATAAGCX:GCAC<X<7irXXXXnGAACGGCGGG'riU;ii; 

+ + + ♦ + !«» 

CTATCAATGGCCTATimXXriaxrAGCaXliCTnXXXmiAA 
(XrCTCAGCATTCACAAACCCOCACCCTIXXX^C/UCCGAGAAACGaXUC 

+ + + + + 17D0 

TtXXIACrcGTAACTCTnCGCGGTCCGAAGGGCllUX'ICI'll 
IXXAGGCGGAAACGOCnXn-ATCrrrATACTOCntTiaXXJI^^ 

X + ♦ + + + 1800 

- GGrrcCCCCrrrCXrCGACCATAGAAATATCACXJACAGCOCAAAGOGCnx;^ 

TGGAAAAACCCCAGCAA<XXXKX:CTTTTTACX7nXXlXXXXTITI^^ 

i£ ♦ + + + + 190O 

L_ CCTTTTrCKXGTaJTTGCGCCXX;AAAAATGOCAAGGA(XGGAAAACGAC 

I CCCTATTACCGCCTTCA(7rcACCTCATACCCCIXXXXXX:ACCCGAACG 

t + + + + + 2D0O 

^ GGCATAAlXXXXX^AAACrcACTtXJACTATXXXXIACCCGCGTXXXXTrcC 

I 

o 

h 7xxxxxxxnxritrrc«nxx7rAca:cax:ATitAccccGCACGGCX7r^^ 

+ + + + + 2100 

AGCCCXXTMACACACClClAGCATXXCaXCnAAGlCCCX^ 

GAAACCTCCTCGAAACGACGCAlXTItTITCClCCltXnTXXTrACACGT^^ 

+ + + + + 2200 

CrriXXrAGGAGClTItXnCCGTACACAAGGACCACCAACCATCnXXAO^ 

Acoc<v3r^(7it7rocA(rnt7rcccG^ 

+ + + + + 2100 

TXr,<r(XCTG\CACGTCAAC:ACCCCAaXGGGAGTCXXn^ 

CCGTGAGCXllCCTtXXrGACGAATnXUGCAGCnnWX^^ 

+ -- •- -- -- - + -- -- -- -- - + -•--«•.•. + .... .....^ 

(Q tXXL\CIXXXnXXj\GaXXnXXnTAAACTXX7IXXACACCG^ 

a7iTxxx:ccrAGcxxxxaxrrACATca\GGOGAACCCM 



< 



(5 



0 



arAr^c/xjATCccccccccATGTAOciraxnTwxnTtnt^^ 

AGCGCCCCGGGGTXXXllTCCCCTCXXXAACGCGATCCTGGGCA-lT^^ 

+ + + + + + 2300 

TCXKXGGOCCtrArXXn'AGGCCACCCCTKXXCTAGO^OCCCCT 
^(XAATAaxrCCCGOCTAACCXXXntXXZATACATWXXXXXTIXXXXXXIA 

+ + + + + 2300 

TTXXTITATr^rAXarGCATrCCGCGAGCCTATCTrACCX^ 

I, COV\AAACCrCGGrCACATCGCCIXXX;AAACXXiUTXXXnXXl\CIXL\aV^ 

+ + + + + 2800 

ItTl-rmtXXKXrGGTXTTAGOGGACanTIXXrCTrACXXAGGTCAG^ 
GCCGCGCIXXX:GTCACCACACCAOGTACAAAGaXXnxrGA(XCaXT^ 

+ + + ♦ + 2900 

aXX:Ga;A(XGCAGTCGTCTCX7ItXL4TGlTlCGCCCACCCT^ 

ccctcatx:cgcatctacctcxxx:acccggaacgtccaccgactcggccg 

gggactacgcctagatggacggctgcgccitgcaccixxxixxivgccgcc 
accnxtixnxxcggacccctacccgacagcgaggtocgogcci^^ 

+ + + + + + 3100 

lGCArj\CAGGGCCirXXX;ATCCOCTCTCXXna::AGGCGCGCTAG^ 

CGTr,GTCTA(XAGCCCACACTC:AGTCXXKXXXL\GT^^ 

+ + ♦ + 3QD0 

CACCACATG<nXXXXnGTCAGTCACCaKXX7n:AGCCGGTAGAGOGCCT 
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^1 



GTClGGAlXXnXJTACCmrjiCGACCAGCACCTn^ 

mi + + + + 

CAGAOCTAOCAGATGCACXTIXXnXXntXnt^^ 
dDPNDVDVVLLNALATP 
ATCXXXnXX^AGCAGATtXTKXXnTGC^ 

2801 + ^ + + 

TAGGGGAGCTOGlCTAGCAGCGAACGGritXXXXX^^ 

d GELLDDSALPWYPLW 

TAATCACCGGTCTATCCrrOCXLiCACC^ 

290h + + + 

d ATIACItXXXIACATACCACGCItTITKlXXACG^ 

TVPTNDSVLE LDSI ES 

TGATGAAACA(XACXXLiCAGCa:;AGCA(X^ 

3001 » » ♦ + 

ACTACri'lt/lljGlCtX;! U'ltXXXntXntKXXXTITXXnXXiACATCGTrG 
dS S VG AVA SCGWGG TG 

CTGTCXXXXXXXrrACACGCOCXXTTAGAOCXX 

3101- + ♦ + 

CAGAiXGCOCKXIATGTtKXXKXXSAATCTGC^^ 



CrCACCrOGCTTTATGGCGTACGAATCGCCIXnU^ CD 

mh + + + + n 

CACnUGACTGAAATACXXXIATGCTrAGCroACAC^ *n 

OGAGGOCGCAOXXXX^CXXXXXTIXTIX^ 

- + ^ * + O 

GCTCXXKKXTKKrCGCCOGCGCCACAGTIXrriT^^ 2. 

OCGCaXUTGAGaXXXCrTTACOCTXXX^^ U- 

34or + + + 

GGaXKXnAClXXGCGOGGAATOCGA(XGAC^ X 

AGGCintlXXXnTITTAAGGCTCAATITGCTIt^^ h- 

3501- + + + -I > 

TCCGGAGCGGGAAAAATTCXXiACTrAAACCAACAGACGC^ ^ 

ACACATGAOCAACTTXKATAACGTIXnt^^ j- 

360h + ♦ + + + U 

TCTX7rACIGGTrcAAGCTAITXK:AAGAGC^ ^ 

ACACCmnxntTTAAGOGGATGaiXXJAGCAGAC^ < 

VO] + + ♦ + + ^ U 

TKnCCAACKCACATTCXXXJTACGCCCCrCGC^ ^ 



GCGATAGCCGACTTTTATACTCGCTrAACTATGCGGCAT^ 

3801 + ♦ + + + - 

CGCTATCGCXTrcACATATGACCGAATrcAlACGCCGTAC^ 

AGAAAATACOGCATCAGGOCXnXnTCaxnTI^^ 

3901- + + + + 

TXTITrATCX;OGTACT€CCCXUGAACC0C;AAOT^ 

TAATA(XGTrATCCACAGAAlTj^GaXL\TAA(^^ 

4001 + + + ^ + 

ATTATXKXLiATAGGTCTCTrAGTCCCCTATTT^^ 

GirrntXATAGGCTCxxxxxrccxrrACCA^ 

4101 ^ ^ + + ♦ 

CAAAAAGGTATCCGAGCCXXXXXXJACIXXTT^ 

cxxocTGGAAGcrocxnx:cTxxxx:TX7r^^ 

43D1 + + + + + 

GGOXJACCITOJAGGGAGCACGCCAGAGGACAAGGCIXXXiAOGGOT 
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- + * ^ * ^ *2800 

MAEIYRRGLASARSQRYI 

- + . + ♦ 4 + 2900 

rnXACGAGTITAAGCAGCOCXlTTXLiOCX^ 

LQE FED AVMSVS PLLC RN 

"TGAGCGGCCACCOCAOiACTQCACAC^^ 

' * " * ^ ^ + + + 3000 

4cnxaxGrimxntnTGACCTGTtMG^ 

LPWGVVACEG ARGD LGS 
CAGGAGGAACACAlXXiritX;ilItXUGGACX;illtX^ 

' • • * - • • • + - - * - + 3100 

GTCClIXTItnCTACGCAGCAAAGCn^^ 
V L L F V 

CTGAATCaXXXKTTACGAGOCACACAGCACaDC^ 

,^---.--.--4.-----. + ^3nn 
< GGACTTACXXXXXrATXXnXXKTIXnxntXri^^ 
^ ^ rep - MIb 



X 



QD 



"0 



0 



GAGCTTUGATAOGOCCrACTX^CCCTtXX:^ 

O + + + -I- 4- + 3300 

CIXXACTCTATGCGCXUTGAGTGCGACOC^ 

AOCAGCAGGIGriXXUGGCTIXXXnXXAAGTCX^C^ 



aXXTTCCTXXIAC^AGCrLXXJAACCGAGCTrcACITIXXT^ 



5 *- + + — ■ + + +3SD0 

CAACXAOtJnXCCCAGCItXXXLi^ 

I CrTXTIXXj^CGGTGTATCTAOCXTrTAGTC^^ 

^ + ^ + 4- +36D0 

^ CGAACAGCrrcOIACATAGATXXKAATCAGCTTrc^ 

5 rcCGrrcATGACGCrrcAAAACCTCTCACACAT^ 

^ ♦ + + 3?D0 

AGCCACTACTGaL\CriTrOCAGu\CIX7I^^ 

CTTCAGCCGCTGTIXXXXXXnxntXKXXXiG^ 
ACTCGCOilACAACCGCCCACACCCCCGCa 

ACTGAGAGTGCACCATATGCGGTXTItUAATAO^^ 

+ ♦ + -f +3900 

TGACTCTCACGTtXTrATACCCCACACTIT^ 

+ + + + -^4000 

OCXlACCXJiCaUiGCCACATTATnJAOCAJCGlt^ 

_ CAAAAGaXACCAAAAGACCAGGAACCCTAAAAAGOOC^^ 

- - - + - - - - ♦ + +4100 

GTTITCCGGTCXriTITCCCGTtXnT^ 

AAGTCAGAGGTCXXXAAACaX^CAGGACrATAAAGATAC^ 

+ + + + + + 42D0 

CGGATACCTGrixrcccirnntx^ 

— + - + + + + 4300 

[XCTATCGACAGOCGGAAAGAGGGAAGCanTtI»VCa 
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s5 I HI 

330) * + ♦ + * ' 

TCCaXXTTOGTCXXXXXritXXXXXGCTOCrnTIXL^ 

0GGCTACAGCGACGGClACAACaXX>CCa;ACIX7ICCGCAAAAACCCGCG 

3301- •• ♦ ♦ + + • * ♦ * 

GCOGATGTCGCTCXXXMTGrnXXXXXnXXXXrreACACGOGT^^ 

3«1 ♦ ♦ + + + 

CAGCAGGCOGAGCACCGCUlLLIUXXTITr/ntZACrGAGCGGCICC^ 

GGCCXXIAAAaXXXAAACATniXXXXnXXIATtnXXJACACCGrrAAGa;^ ^ 

3SD1 + •♦■ "*■ + ..- - - - - + ^ 

CCGGOGTTKXDCGGTITCTAAAGaXXIACCTAGACCroi^^ 

AAAGGax:ACAA0GAAGCaUCAATCX:A0CGCK7nX^^ 

3601 + + + + r 

TnxrCGtntTITGCTItXXKnXTITAtXnWXXIACAAG ^ 

I 

CAGCTAAAAGTCCnKTTAGACGCTAGTITnnXKTnTGGGa^T^ ^ 

U 

CGCriTCTAa;AAlClllX;iCCATACCAAGOCATITCCGCrGAATATCG < 

3801 + - + + -•- 2 

CarAAGATGCTrAGAACCAGCTATGGTrCGGTAAAGGCGACTTATAGC 

MuUiplc Cloning Site 

S B 
N Kg 
End M. rep e c pi 

t I n I 

I I II 

TrcrAGrn7ITGTGGrGGCATCXXnXXKXXXXXXXX}CGTAC^ 

AACATCACAACACCACCGTAGGCACCGCGCXGCCGOCATGCrrCTAGAA 
S 

Stop CoBdont p Begin Tnin9cripUon Tcrmlnnlor 

3 Frames I 
I 

GAaTTACrn'AACrAGCGTACGATCCATCGOCAGGCATCAAATAAAACG 

^'cTtKXniATTCAT^^ 
S 

S CL B 

r c c 

\ \ i FIO. ISC A 

CATCATOOOCXXXXTrGATCA 
4101 ♦ 

crAGTAOOGGoarAcrAcr 
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FIO. I3oB 



< 

u 



TCaXXTTCLACXXATGCATCGAGGCATIXXTATGAGaUCGGCTACAGa^A 

+ ■»■ + + +3300 

GCCCCAGTCGCriTACCTACCITXXTrAACGATACTtXXntXXXJATCTnX^ 



CGTGAC(XCOGCCGAAGGCGCrcGAATCAOCGGACTATC^^ 

+ + + + +3400 

GCAClXXKXXXXXXnrOCGCGAGCTrACntXXXTOATAGGCTItXXXn^ 



TGCACXXXXXXUACXXATOCGaXXTATCACGACXUOG^ 

+ ♦ + ••■*•- + 3SD0 

ACGTGCGGCGCnTXXTTACGCXXXXJATACfTt/TKXnxy^ 



0 (nCGGCrATCXXXX:GAGGAAAGAGCGTCXXXX:AGAACAGGAAGaXXnGA 

(Z *••• ••'::*•'•'• — *^ 

GAGCCXIATAGCXXXXntX'l-lUUlLXXLiOGOCOnLTlUlUrntSCaSAGT 

X <K;AGCGC€riXnXXXXXXXX7nXXGTGC(XXX7riXrGT^^ 

h; . + + + + +' +3700 

^ CCCTX;GCCCACAGOGa:CCCAAGGCACX;COCCAAGGCAACGTrGax:AGCCr 

1 

O GTCTCCTTGCGTGTTIXXnTGCGCXXXnTrrGAATACCAGCCAGACGAGAro 

I- . + + + + + +3800 

^ G\GAGCAACGCACAAAGCAAOG<XXXXj^AAACTrAT(K7nXXnXnXX^ 



GGGAGCnXIACCGCCAGAATCCGTGCriTGTGGrrGATtnACGTGGCGA^ 

. + + + + + +3900 

tXrTCGAGTGGCGGTCTTAGCCACCAACACXIACTACATCCACOCKnT^ 

H 
1 







E 




B 


P 


E 


R 






D 


X 


c 


N 




V 


Pc 


d 


C 


s 


r 


b 


o 


c 


m 


u 


so 


I 


1 








R 


o 


H 


I 


tR 


I 




1 


I 


I 


V 


I 


I 


I 


II 


I 


I 


I 



TAAATCTAGATATCCATGGATCCAGCTGCAGAATrCGAAGCTrATTXJATt^ 

. + + ♦ + -• + +4000 

AmAGATCTATAGCTACCTAGCTIXXJACGTCTrAAGCrrCGAATAGCTACAG 



AAAGGCrCAGTOGAAAGACTGGGCX:rilUii 11 1 A'lUlUllO-l TH/IUCGGC 

+ + + + + +4100 

TiXXXUCTX^GCTTTCrGACCCGGAAAGCAAAATAGACAACAAACAGGCCG 
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Fl O. IS B 



90 



AAGATAAAAATATATCATCATGAACAATAAAACTGTCTCCITACATAA 

+ + + ♦ + 100 

Tnn-ATrmATATAGTAGTACTIXTITATITrcACAGAOGAATGTATr 

GA(XXXXXX;ATrAAATIXX:AACATGCATtXnx;ATmTATGGGTATAA 

4. + + + + 200 

CTOXXX3Xn"AATrrAAGGTTCrACCrACGACTAAATATACCC\TATr 

GGCAAGOCCCATGCGCCAGACriTtTITIXnXJAAACATGGCAAAGCTAGC 

^ + ♦ + + 300 

ACCCnCGT^TACGCGGTCnXIAACAAAGACITrcrAaxnT^^ 

TITAT«XlXTIX:CGACCATCAAGCATITrATCan-ACI^^ 

^ + + •♦■ + 400 

ATACCGAGAACXXntXTrAGTIXXTrAAAATACGCATGAAGGAC ^^ 
AGAATATCCTCATICAGGTCAAAATATIXTrroATtKXXnXMX:Ao^ 

+ ♦ + + + 500 

TtnTATAGGACTAAClXXIACTmATAACAACrACGCGAOCG TCACAA 

CXrCTAlllLtilCTCGCTCAGGCCCAATCAOGAATGAATAACGGTrro 

. + 600 

GCXX^ATyUlAGC^ 

< TCIXXIAAAGAAATGCATAATCITritXXATItnXIAaXW 

o + + + — -V* - *^ ' • n* • • + 700 

~ GACC Cri IL ' ri lA CCTATrAGAAAACCGTAAGAgrGCCCrAAGTCACC 

to ATAGGlTXrrATrcATGTIXX;ACGAGTOCK;AATCGCAGA0CGATACCA 

_ + + + + + 800 

^ lTATCCAACATAACrACAACCIWnx:AGCCITAGOGTClXXX:^ 

GAAACGGCTITITCAAAAATATGGTATrGATAATCCrGATATGAATAAA 

X + + ♦ + + 900 

t TCTTKXXXIAAAAACnTmATACXIATAACTATrAGGACrATACrrA'nT 

^ GCTIXTTAACACrGGCAGAGCATrACCCTCACTrcAOGGGACGGCGGCT 

X + + ♦ •- + --- + 1000 

CX:AACATn7ir.AaxnCroCTAATGCGACIt;AACTGODC^ 

< COJACAACGCAGACOGTIXXXTIXXXIAAAGCAAAACITCAAAATCACX: 

2 + + + + +1100 

GGCTOTIXXXnCrGGCAAGGCACCO-ri'lWril^CAAGTnTA GTGG 

GCTXXUlXUTGGGCaUTrcAGGOnXXTrATGACraGCAACACOT 
oiACCTACTACOCCGCTAAGTCOGGAC^ 

GATCAAAGGATCnXTTCAGATOCTITnTIXntXrG^ ^ 

GAGCTACCAACTCTITITOCGAAGGTAACIXXXnTCAGCAGAGCG^ 

+ ..-..---• + -- -- •- -- - + ••*••• - -' + -•-'-*'** + 1400 
CTCGATWITGAGAAAAAGGCITOCATTCACr^ 

ACIXnxn-AGCAOTCCTACATACCIXXXnxnXKTAAT^^ ^ 
rcAGACATOCTCGCTCAT^ 
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no. ISA 

N 

I 

GCrAGCCAACAAAGOCGAO;!' J O'lVlC'ItAAAATCTCTGL^TtTTTACATrcCAC 

1 + + ♦ + +-•• 

CGATCGCTTCTTCGCTCCAACACAGACTITTAGAGACTACAATCr^^ 

ACACTAATACAACGGGTOITAIEAGaiATATrcAACaXXIAAAO^^ 

101 ♦ + + + 

TCnCATTATtnTCCCCACAATACTCGCTATAACTTGCCCTTIT^ 
N 

I 

201 ATX3GCCTO(XX;ATAAT<nXXXKX:AATt:AGGTCX:GACAATCTA7CGTrCTAT 

TACOCGACCG CTAm CAGCCCGTrAGritrACr.CT^ 

CTIWXAATtUTCTmCAGATCAGATWnCAGACrAAACIXXX^^ 

301 ♦ + + ^. 

CAAOCCmCTACAATGriCTACTCTAOCAGTCTtUTrrc 

CATtX?mCTCAOCACTGOGATOCCaXX;AAAACAGCATItX:AGCrATr^ 
401. •-••-•••■♦••-••-•.-• + •».•..... A.. ...... .4.. 

CrACCAATXUCIWrcACGCTAGGGCCCCTriTGTCGTAA<^^ 

ccitxxxxxx7mx:ATTcx;ATroc^^ 
501 + ♦ + + m 

CXUOOCGGCCAAOCTrAAGCTAAGGACAAACATrAACAGGAAAAritnt?^ f) 

ClTCu<TXXXUGTGATITIXUTCL*CGAGayrAAT 
«0h ♦ + * ♦ 

CAACTACGCIXL4CTAAAACTACrccroCCATACCCACCGGACAAmX7^ O 

^jjjJ^^J^^^'nTxntiAc^ - 

AGTCACTAOCACTAAAGAGTGAACTATTCK^T>0u>UC^^ ^ 

GGATCTyGCCATtXTATTX;AACTr,OCTCGGTCACTriTC^^ ^- 
801 ♦ + + ^ — 

CCrAGAACCCTAGGATACCTrcACGGAGCCACTCAAAAGACGAAGT/^^ ^ 

nf^Ar-mT'^T^.^™^ IS^N STOP CONDON x 

AACCTCAAAGTAAACTA(XAC(TA(^ < 

TItrnGAATAAATtX;AACTTm;CIGAGrrGAAr^.ATCAGATX:AC^^ 2 
1001 ♦ + + ^ 

AACAACTTATITAGCTrcAAAAOCACrCAACTrcCTAGriCTAGTCCCT^^ 

AACTCXTiraCCTACAACAAAGCTtnt^TCAACanXXXTrcccrc 
1101 + + + ^ 

END KANCASSETTEje.f: S BEGIN E.RAP 

CCItACGArKX:AGACXnCACTAGTTXXACTGAGanx:AGA(rCOCTAGAA^ 

12D1 + + + ♦ + 

GAAGTCCTCCXJICIOCACTCATCAAGGTCACTCGCAGTCTGGCGCATCTr^ 

CITGCAAA(:AAAAAAAOC:ACCGCrACCAGCr.GTr, (,l rU/i ri GOOGGATCAA 
1301 '* + + + 

GAAOTnTCITITITItXrrGGCGATGGlXXXXACCA^ 

ATACCAAATACTCTCCTIXn'AGTGTAGCCCTAGTTAGrxrACCACTTCAAGA 
1401 ♦ + + + 

TATCGTmTGACACOIAAGATCAOVTCGGCATCAATCCGGTCGTGAAGTrCT 



SUBSTITUTE SHEET 



wo 92/21376 



PCT/US92/04538 



(A 5 

(0 Z 
O Q 

^ o 

a. g 
o o 

cU 
o CD 

lo 

-t: ft: 

o 



Q. 

if) 

X 

i 

«> • 
.if 

ao 

xa. 

<n 

X 
o 
X 



0 



« 

o 

X 

E 
o 
m 

o 

jO 

X 



ID 



op — '^'''lOrOio 

Q-o— 0 = ~" 



CO 0,0— ^^-^ 



* CD 





O 

o 

o-^K ^ 

CD— — r^f^ ^ 0>rf) ^ 

tf,}-^^ 



OD 



CD 



SUBSTITUTE SHEET 



wo 92/21376 



PCr/US92/04538 



8 
i* 

U 
<l 

< 

o 
u 

< 
< 
o 

u 
I- 

< 
< 

< 

u 
o 

13 
o 

< 
u 
o 

o 
o 

< 
< 

u 
u 
o 
o 

!< 



6!S\ 



Q. 



d 

Q. 



d 

CO 

o . 

1 

-1 u 

5| 
c 

I 

CO 

q: 

0 
0 

3 

> ^ 



d 

00 



(0 

0 



< 
CL 

liJ 

to 

< 

CQ 

CO 

to 



> 
Q. 



> 
2 



U. 

o 

to 
o 

z 

to 

UJ 

to 

z 
o 

tO 
UJ 

a: 



o 
Z 



fO 
UJ 

to 
< 

CD 

q: 
o 

z 
o 
o 
o 
u 



<0 

u 

o 
I- 

I 
I- 

> 

<0 
UJ 

I 
I- 

_ \- ' 



to (0 

tJ X 



E 
d 

UJ CD 
X 

V- Q 
UJ 

-J 

to tc 
to ^ 

UJ 
UJ ^ 



o 
to 

Q. 

to 

X 

u_ 
O 



z 

o 

u 
u 

D 



D < 

O <o 

UJ Q 

< UJ UJ 

to Z Q 

UJ UJ ^ 

5 ^ < 

O to > 

TT- I 3 



I 
I- 



UJ 

< 
U. 



z 

< 

z 
g 

I- 

Q. 

tr 
u 
to 
z 



z 
o 

Q 
O 



Q. 

O 
I- 

09 

U 



o 

UJ 

Z 

-J 

a: 

UJ 

O 

z 

D 
O 

to 



h < 



UJ 
X 



U «0 UJ 

if^ CE < 

f < tj 

u «^ 

H Q ^ 

d ? z 

tD „ O 

to o 

UJ O 



u 



I- 



< 
UJ 

q: 



ql 

ILI 

I- o 



in 



C) < UJ 

UJ a 

CO K 



X 



0 



Q 

z 
< 

cr: 

Is 



to 
o 



UJ 

to 

o 
z 

z 
o 
_I 
(J 



-I 



D 

o 
I- 

X 

I- 

z 
> 

«o 



b 

•< 


CG 




o 


o 




b 




<*. 




u 


< 


<J 
1- 


TA 


i< 










< 








b 






U 


< 




o 


< 








< 

F 




1— 


< 


P 






u 


< 


^ 






< 


1- 


U 


u 


U 


o 




< 


% 


< 






o 




u 




o 




(J 


< 




< 






CD 


< 


U 


u 


O 


o 


U 




8 




< 


< 


< 




O 





< 
< 

< 
o 
u 
«- 
o 
< 

< 

< 
< 



< 
< 

u 

!i 

u 
o 
o 

< 

u 
u 

< 



< 



o 
u 

t5 

u 

tj 

o 
o 

u 

u 

o 

o 
u 
u 

t5 
o 

H 
U 

i3 



V5 

U 
< 



I- P 

I- u 

u o 

u < 

^ t5 



7 



SUBSTITUTE SHEET 



wo 92/21376 



PCr/US92/04538 



Bel I 4500 
Sfl 14483 
Sol I 4383 
Clal 4377 
HinD 111 4371 
EcoR I 4364 
• Pst I 4359 
•Pvu II 4356 
BomH I 4351 
Xbo I 3953 
Drol 3947 
•Bgl 113942 
•Kpnl3936 
Not I 3928 

HSP60 



Fl G. IS 



INhel- 



Bgl II 2797 




1219 Spe I 



2017 Not I 



1112 EcoRV. 
I BomH I- 824Ckil. 

■ 



3079 BomHi. 
3005 EcoR I 



BomH I locZ cossette 
3064 base pairs 



Digest w/BomH I 
Ligate w/BomHI LocZ cassette 
Transform E. col i 
Select blue colonies on X- gol Plates 



•Sol I 746! 
• Clo I 7455 
HinD III 7449 
EcoR I 7442 
■Pst I 7437 
BomH I 7429 , 
•EcoR I 7356 *Nhel- 
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1219 Spe I 



EcoR V 
2017 Not I 
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2797 Bgl II 



BamH I 4361 

Xbo I 3963 
Bgl II 3942 
Kpn I 3936 
Not I 3928 



SUBSTITUTE SHEET 



wo 92/21376 



PCr/US92/04538 




SUBSTITUTE SHEET 



wo 92/21376 



PCr/US92/04538 



51 <l 




X 


X 


Cl 


^ < 




X 


X X 


1 
1 








X 


YA 






> 


> 




h > < 


i 

r* 


t 


>- 


r 

> 


1 

1 






LJ 








K cc 


t 

»J 








1 

1 


1 

r— 














1— 
















b) 


X 


X 










> 


7 


IaJ 










nr 


wl 


H UJ 




z 




< 


1 




V 


> 










n 




2 




to 




1 


III 

□ 


1 


1 
1 


1 


1 


f 
1 




1 


1 


< 


< 


1 
1 


1 




vj 






CJ 


X in 


Z 


t/) 


QC 


ac 


VI 


a: 






Qu 




Ul 


^ liJ 


if) 


d: 






I 




X 






X 


X 


X 


X X 




X 






X 


O 






o 


#1 

w 


0 


0 


0 0 


1 


0 


iO 


lO 










2 


rJ 
>• 


1 














1 


UJ 






«J 

</) 


>- 

U 


0 


it: 5d 


ON 






V-/ 


1 








o 




0 


0 












0 














? 


> > 


$ 










> 


s 


• 1 1 
lii 


> 










UJ 


UJ 






1 










QC 






X 




h 




1 




1 








t - 




> 0 














CO 


wj 


Ol 




9 






to 


UJ 


UJ 


1 

1 


O 










> 


u. 


1 


1 

r— 


1 

r— 


1 


1 


1 


1 


1 




1 


1 1 


1 


lO 

Vi 


1 


1 


1 

0 


o 










f ^ 




0 0 


0 




i 


0 


z 




< 






□ 


in 


UJ Z 


:c 








t 




UJ 


o: 


> 




XL 




UJ 


UJ 




X 


1 




< 


< 






<< 


-J 






If 


in 


1/) 


1 


U. 






X 


-J 


1 




3 i 




li 


1 


1 


1 






o 


I < 


ILI 


< 


0 q: 


a: 


Ll 


u. 


1 






tr 




LL 










to 








< 






5 










U 








f 


li. 


X 


% 


J; 


U 




LL 




u. 


& 








»- 


1- 


cn 




CO 




LL 










X 


X 


> 


X 


X 


X 


X 


X X 


X 


X 


X 


cc 


q: 




a: 




a: 


oc 


OC DC 


cC 


cr 


q: 


a: 


a: 


-J 


pi 




-J 


•J 




J U. 


-1 


-1 


0 


0 


_j 


> 












2 < 


0 


Q 






1 


X 


X 


X 


X 


X 


X 


X 


X X 


X 


X 


X 


X 


X 



o 

5 

q: 

Q. 

I 

t 

X 



!( 

I 

< 

u. 
H 

3 
I 

-J 

o 

X 



0 
01 




cv o — ^ d tL 

C\J<0 — — (VJ*D — 

*<X-«-Q.tLCL-.I^CL^ 



h- ^- I- H 

2 2 Z Z 



Z 2 



I- H 
Z 2 



n — 

o c 2: I- 

I- H 

^ CD u n o 

0 o ^ n 

cr fwooi — - Q_ 



o 



;UBSTiTUTE SHEET 



I 



I 



i 0 



? o 

I a: 

_j _» 

I UJ 

— _j 

UJ UJ 

CO O 



E _ 

O to 



a: 



< 



2 
u 
CO 

o •n 



wo 92/21376 



PCr/US92/04538 



FIO. 

CAGCTCGCTGAGAGOOrrcAACBACAGCXX^GAACGCCAGCOCGO^ 

+ + + + 100 

GGlXX;AGOCACTCTCXXXL\CTIlXTC7nXXGCTI^ 

< IUnriTCACTGC\CCAG€TTXLAATTnXX7IX7rG^ 

--•••--- + -- •••-••• + •- -••-••- + •- .. + -jy, 

(\j GAGAAGItJACGTGGTCGAGGTrAGACXIACACTrACGGGGAGCAGACAAC 
.GGGATGCGTrGC\ACCGOGTATGCCCAGGTCAGAAGAGTXXK:ACA^ 

^ + + + + + 300 

U. TCCCrACGCAACGTIXXKXX:ATACGGClXXL\GriXnTXnx:ACCGT^^ 

I <iGTC(XTrrrC.rrc^^ 40o 



alt P core 



X CrcGGCTGCATCCnxn'AAGIXKl^GAAATIXXl\GGTCGTAG^ 

O + + -- ■«• + + 500 

J- AGAGCCGACGTAGGAGATIXLlC CiritU ' iU 'AAOGTCCAGCA' iL 'i it 'COGC 

< 

2 GAGAGGAGACXn-AGTTCGCAACXTnXXXXUTGGGGATOGCTGAAGACn: 

GCrCTCCIXnXKJATCAAarGTGCAGCGCCrAOCCCTAGO^ ^ 
Pail 
I 

GGTACn^ACGCGCTCCAGACCrACGACAACAAC ATT^nr^r.A a nrrrr. 

+ + + + + 700 

CCATGATGCGCGACGTXnXXiATGCIXTnXTITCTACCTGiXGCIT^^ 
YY AL QT YD NKMDA EAW 

InlsUrt? 

ACCGGGCGAAGAAGGCAGCCGCCAGCGCCATCACGCTGGAGGAGTAC 
+ + + + + 800 

CCTXXKTCGCrrCTrcCGTCGGCGCTrcGCCGTAGTXXXJACCTCCIXjVTC 
D RAKKAAA S A IT LEE Y 

TACAGCGGGCACGOGGAGCGCCGCATCTA(XCCCrrGCrAGCrrGAACTGG onn 
+ + + + + ^ 

ATXrn:GCXXXrrnCGCCIt:GCGG<X;TAGATGGGCCAOGATCCACIT^ 

YSGIIAERRIYPVLCEVA 

GTAGGAAGCACCCGACIXXrcrGCCGGCATnCCTACAACXTtXTCCCG*^ 

+ + + + + 1000 

ATOCTrcGTCGC/rrcACGGGCGr/rOGTACGGATGTIXXIAGGAGGCCO^ 
RKHPTA R RHA YNV LRA 

ATCGAGCAGAAGGCAGCCGATGAGCGCGAOGTAGAGGCGCrGAOGCCr 
+ + + + + 1100 

tagctcgtcttccgtx:goctacixxx:gctgcatctccgcgact^^ 
I eqkaa der dve a ltp 



CATACATCCrrCGCGTt^GACGAGCCTCCXXJTItXXJACAGCTXlATCGACC 
+ + + + + 12» 

gtatcttaggaccgcacctccrcggaggccaagccnxntxjactagctc^ 
yilawtslrfgeliel 

gorgtgccgcrtcccgcxnxxxxjaaougatcgtcxtrtcgcaac^^ 

* + * + + 1300 

CGGCACTGCGAAGGGOGCACCLL'liti'llCTAGCAGCAACCGriTGCGGTr 
RGAS RVGNKIVVGN AK 
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Fl O. 

J GTCGACCACCAAGGGCAOCATCTCrGClTGGC-,CCACCCCCrriXX;^ 
CAGClXK7rCGTTCCa7IXXrrACAGACXJAACCCGGTG<XXX:AA 

gcgagggtixxgaccgcixk:aactcccggtccaaccttgtcccggtctat 

CGCrcOCAAGGCTC«X;A(XTrcAGGGCCACGllXX;AACAGG(^ CD 

GCGCAGGCXXKXXXXnurATnXTITGTCAGCATXXJAAAGTAGOCAGATCA ^, 
201 + + + + + ^ 

CGCGTCCCCCCCCOGAGATAAGCAAACAGTCGTAGCTITCATCGCTCTAG O 

301 ?7^p.^55^^^AAAGAAAAATGGCCAGAGCGOGAAAACACOT ^ 

AACGTCTCGGGACcnitj^ -- 

Nde I I 

I - 

401 TGGGTGTClGCCGACCACATATGGGCCGGTCAAGATAGGTTTTTACCCCCr ^ 

A(XACAGACGGCTGGTGTATACOCGGCCAGTrCTATCCAAAAAATGGCGG ^ 
501 TTCAAGOCTCAGACTITGCACAGGAGTlXXIAACXCGGTAGanTCTn^ ^ 

AACnTCGGACTCTCAACGrcTCCTXTAACG^^ 5 
Bamlll 
I 

AGCGCACCGGGAGGATCCAACCCrcATAOCTCAACCCGCAGOACCGTmnA 

601 + + + ^. ^. 

TCXKX;TCG<XCTCCrAGGTTCGGACTATGCAGTrGGGCGTCCTGCCACACT 
Im V ^ 

Int sUrl? 

CGCGGGCGAGAACCGGCTCATCGAGATGGAGACCTGGACCCClXrCACAGG 
701 + + + + + . 

CGAGCGCC(XKnxnTCGOCGACTAGClXTACClCIXX>ACCTCGCGACGTt7r 
Int LAGEKRLIEMETWT PPQ 

ACaTGGAAGTGGCIXXTIXXSAGCGCGACCrCGCACACGCCAOCACGGATaX? 
BOl + + + + + . 

TGGGCCTTCACCGAGCACCTCGCGCTGGAGCGTXTITX:^^ 

TRKWLVERDLADCTRDL 

CGCTCACAGAGATGACGCCAGCTCTGGTGCGTTXrGTGGrrGCGCaXKM 

'Ol + * + + 

GCCAGTGTCICTACTGCXKTItXJAGACXIACOC^CGCACCACCCCGCCCTACXX: 
VTEMTPA LVRAWWAGMG 

GCTGATGAACACAGCGGTCGAGGACAAGCTGATCXXrAGAGAACCaTrGCCGG 
iCOl + ^. ^. ^ 

CCACTACITGTCTCGCCAGCItXTCTIXTGACTACCtriXTI^^ 

VMNTA VEDKl. lAENPCR 

Bglll 
I 

GATCAGCTCGACATCXnrGCCGCrcAGATCTTCGAGCACTACCGGATCC^ 
1101 + + + + + 

CTCCTOGACCnt7rAGCAGCGGCCACTCTAGAAGCrCGTGATr,GCCTAGCG^ 
EELDIVAAEIFEHYRIAA 

TIXr»CCGCAAGGACATCGTXX;ACGACGGCATfiAOGATGAAGCrCCGGGTGC 
I2D1 + + + + ^. 

AAGCGGCGTItXTCTAGCACCTGCTGCCGTACTtXn'ACTTCGAr.G<XC^ 
R RKD I VDD GMT M K I. R V R 
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EcoOI09r3J48 
Acttn 3089 



FIO. 33 



XmMl2768 
5cQ.r 2640 




AlwNI 1689 



i278Af IJir 



FIO. 2-^ 



AQ.tlI9779 



HlhDin 7609 
SphI 7603 
XbciI7585 
Ba.mHI7579 

ApQ.1 7387 
Ndal 7382 



404 Dra m 
510 NcxeT 

707 Bbal 
707 NcthI 

868EcoRI 
874 Sac I 
880 Asp7l8 

880 KphI 
889 BctmHI 
896 XbcL I 
901 Acc I 
901 HihCn 
901 SctIT 
906.BSpMI 

907 Pstl 
913 SphI 

919 HinDin 

^'='*j8 68EcoRI 
707Noihl874SacI 
880 ASp7)8 
880 KphI 

884Str»aI 
884XmcLl 
889BQ.mHI 

n86Sa«.I 
•^'OOStul 




CIciT 5519 
Nd«.I 5463 



4605 ShaBI 
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Xmh 14658 
Seal 454 I 



no. 2S 



AIwNr358l 



Afini 3170 



HinDIUZSII 
PstI 2799 

XbcxI2787 
Bei.mHI278l 
SHADE D PORTION = L5 DNA 




404 Dro. Ill 

707 Bb«I 

707 No. hi 

868EcoRI 
874SQ.CI 
880 ASp7l 8 
880KpnI 

884 Xm Q.I 
889BQ.mHI 
939 Pstr 

1057 BstXI 

»374B5a.36I 
l386Tth lin 

I6505pll 
1657 ApQ.T 
l677BStnI 
1685 BstEl 
l764EttgI 
1869 AUr IT 
l942Bo.tI 



Xm inl 7608 



no. 

404DrQ.T 



AlwuNI 6531 



HlnDm 5761 

)(baI5737 
Bca.mH1573l 



pMH-4 

+ 

8000 BASE PAIRS 
UNIQUE SITES 



707Bber 
70 7NQ.rT 



l727Ndc I 
I782CIQ.I 

2302 Nr« I 



Avrn 4819 
ApQ.1 iJ6l7 
S/o /I 4600 
B5u 361 4324 



2798Bo II 
' ' 3044S<ilI 
3226Sq.I1 

3839Bo.mHI 

SHADE D PORTION = L5 DNA 
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F-|0. -2S 

Xmnl 8569 

AI\A/NI 7492 

HinDin 672 
Pf IMl 6534 

Xmca 6149 
SmcLl 6149 
Xhol 5875 

Pa.«R7J5875 
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